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Editorial 

Peggy Johnson 

Thomas Frey, the executive director and senior futurist at the 
DaVinci Institute, has identified ten trends that he believes are 
affecting the development of libraries. 1 I share a synopsis with 
you to encourage thought about how these trends will likely 
influence what you do in your work and how you plan for the 
future. Some of the trends Frey identifies might serve as start- 
ing points for thoughtful papers about die future of technical 
services. Does anydiing here speak to you? 

1. Communication systems are continually changing the way people access 
information. 

2. All technology ends. All technologies commonly used today will be replaced 
by something new. Frey suggests that writing and books are technologies 
with limited life spans. 

3. We have not yet reached the ultimate small particle for storage — but will 
soon. Standards for information storage will become more critical as stor- 
age particles become smaller, and the most critical component of stabilizing 
information storage will surround the issues of findability. 

4. Search technology will become increasingly more complicated. As search- 
ing technology expands to include the ability to search for such attributes 
as taste, smell, texture, reflectivity, opacity, mass, density, tone, speed, and 
volume, the role of the librarian becomes more and more important. 

5. Time compression is changing the lifestyle of library patrons. It is affecting 
nearly every aspect of our lives. As human need grows, the opportunities for 
libraries to meet these needs is also growing. The library of the future needs 
to be designed to accommodate the changing needs of its constituency. 

6. We will be transitioning to a verbal society. Keyboards remain as die primary 
interface between people and electronic information, yet die days of die key- 
board are numbered. There will be a strong trend toward verbal information. 

7. The demand for global information is growing exponentially. 

8. The stage is being set for a new era of global systems. Our ability to learn 
about and understand the cultures of the world is central to the global 
society of the future. Libraries will play a central role in developing global 
systems because libraries will be charged with archiving and disseminating 
the information necessary for the new systems to flourish. 

9. We are transitioning from a product-based economy to an experience-based 
economy. Books will transition from a product to an experience. As books 
change from words on a page to digital manifestations of information, they will 
be reviewed and evaluated by die experience they create. How patrons experi- 
ence libraries, as well as books, will become a key measurement criteria. 

10. Libraries will transition from a center of information to a center of culture. 
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From the Ubiquitous to 
the Nonexistent 

A Demographic Study of OCLC 
WorldCat 

Jay H. Bernstein 



Analysis of a random sample of bibliographic records from OCLC WorldCat finds 
that the great majority of items in WorldCat are held by very few participating 
libraries, and that an inverse geometric relationship exists between the number of 
libraries holding an item and the number of items with a given level of shared hold- 
ings. The findings provide a context for interpreting holding levels in WorldCat 
with regard to the proportion of widely shared items and the characteristics of 
items at various ranges of holdings. Used with other quantitative and evaluative 
measures, these findings will assist libraries in assessing their collections. 

OCLC WorldCat is arguably one of the most valuable tools available to librar- 
ians, as it provides online, global access to the shared records of thousands 
of libraries around the world. This enables librarians, at a minimum, to verify the 
existence of an item mentioned by a patron and to provide correct bibliographical 
facts of publication for it. More generally, it helps librarians identify and locate 
items that may not be held in their own collections. 

WorldCat is also a potentially powerful research tool for collection analysis 
because each of its bibliographic records indicates the number of member librar- 
ies holding that item. These data are tantalizingly provocative to scholars inter- 
ested in patterns in the distribution of books, knowledge, and information. 

The theoretical basis for considering WorldCat holding levels in library 
research is that the number of libraries holding a given title provides a score 
that can measure that title's influence or impact. Several studies conducted over 
the years on specific library material domains — adult fiction, books from small 
publishers, scientific journals, and award-winning monographic titles in the 
social sciences and humanities — have cited figures on holdings in WorldCat or 
its predecessor, the OCLC Online Union Catalog. 1 Similarly, Mirwis has included 
WorldCat holdings as one of several factors used to rate and rank encyclope- 
dias. 2 

The difficulty with citing these holdings data lies in knowing how to interpret 
them. Without a clear picture of what exactly is in WorldCat and what the spec- 
trum of distribution is, the raw holdings numbers mean little. Only by examining 
a sample of items from WorldCat itself, rather than beginning with particular 
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known items or a particular library collection, can one 
develop an interpretive framework that could maximize the 
analytical value of these data. 

In the interest of providing a context for interpreting 
WorldCat holding levels, I undertook what might be called 
a demographic study of WorldCat, focusing on its composi- 
tion and major characteristics as a population by analyzing a 
random sample of 500 records. The goal in investigating the 
items of WorldCat as representatives of a population was to 
determine the categories of materials that are represented 
in WorldCat and their proportions, the proportion of widely 
held items, and the characteristics of those items. 

Precedents for this study may be found in publications 
by White and Perrault. 3 White demonstrated the connec- 
tion between Research Libraries Group (RLG) Conspectus 
levels and OCLC holding levels. He found that titles at the 
research level had holding levels up to 150, and that the 
midpoint of holdings of titles in library collections at all 
levels was about 400. 4 Perrault studied a systematic random 
sample of nearly 3.4 million WorldCat records to gauge 
trends in library collection building over time and assess 
WorldCat's potential for promoting shared access to scarce 
resources. 3 While her study did not concern levels of shared 
holding in a manner comparable to White's work, she did 
uncover a finding that is striking when considered along- 
side White's results: more than 53 percent of the records in 
Perrault 's sample were held by only one member library. 6 

The present study attempts to show the big picture 
of what is in WorldCat by considering its widely shared 
items against the background of a much larger number of 
extremely scarce items. Like Perrault's study, it employs an 
inductive approach to assess WorldCat as a whole, though 
it is based on a sample far smaller than hers. Like White's 
work, it relates its findings to those items in WorldCat that 
are frequently found in library collections. This study finds 
that 39.8 percent of items in WorldCat are held by only 
one library This is significantly less than the 53.1 percent 
found by Perrault, though it is still a very considerable frac- 
tion. The study also finds that only 9.1 percent of items are 
held by more than 50 participating libraries. Perhaps most 
surprising, the study shows that 1.2 percent of items in 
WorldCat are not held by any libraries. 

OCLC WorldCat in a Nutshell 

Provided by OCLC Online Computer Library Center, 
WorldCat is an online bibliographic database containing 
records created and maintained collectively by more than 
9,000 member institutions (the exact number is unavailable). 
Described by OCLC as "the largest and most comprehen- 
sive database of its kind," its resources "span thousands of 
years and nearly every form of human expression. Records 



exist for everything from stone tablets to electronic books, 
wax recordings to MP3s, DVDs and Web sites.'" A million 
records were added in the first three-and-a-half months of 
2004, bringing the total inventory to 55 million. On average, 
WorldCat adds a new record every 12 seconds, a phenom- 
enally rapid rate of growth. 8 The database has doubled in 
size in just more than eleven years. 9 

Not long after launching its Online Union Catalog 
in 1971, OCLC set forth initiatives throughout the world 
promoting input from other countries that would make 
its database global. In doing so, OCLC had to incorporate 
international variations in MARC format, authority rules, 
record quality, and library traditions, not to mention non- 
Roman alphabets and characters. 10 

The many libraries around the world contributing to 
WorldCat make it an unsurpassed data bank for research on 
the contents of libraries. It is probably a good measure of 
library holdings, especially in the English-speaking world. 
However, this does not necessarily mean that it accurately 
reflects the totality of world library holdings, much less the 
entire store of recorded human knowledge. 

Most institutions that contribute to WorldCat are gov- 
erning members of OCLC, meaning that they contribute 
all current cataloging to an OCLC-affiliated database. Two 
lower grades of OCLC membership involve less-than-full 
participation in WorldCat. More than 77 percent of govern- 
ing members are located in the United States. WorldCat 
seems to represent academic libraries more so than other 
kinds of library. According to data in OCLC's 2003 annual 
report, 31 percent of governing member libraries are col- 
lege and university libraries, compared to 22.2 percent of 
governing members that are public libraries, 12.8 percent 
that are federal, state, or municipal government libraries, 
and 10.2 percent that are corporate or business libraries. 11 
Other categories of libraries making up OCLC's governing 
membership are community college and vocational (8.3 
percent), school (7.4 percent), associations and foundations 
(4.0 percent), state and national (1.0 percent), and other 
(3.0 percent). 12 

Data Collection Procedures 

The first step in data collection was to select 500 random 
numbers from 1 to 54 million using an online random num- 
ber generating service. 13 The maximum number of 54 mil- 
lion was chosen because the database contained "54 million 
quality records and counting" at the time research for the 
study began in July 2004, according to OCLC's WorldCat 
home page. 14 Each randomly chosen number obtained using 
this method was matched with the bibliographic record for 
the item with that OCLC record number. Given the size of 
the chosen population, the sample of 500 provides a confi- 
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dence level of 95 percent and a confidence interval of 4.4. 
Therefore, conclusions about this sample are expected to be 
accurate plus or minus 4.4 percent, with 95 percent certain- 
ty, as statements about the entire WorldCat database. Such 
a sample is adequate for drawing general conclusions about 
the major outlines and proportions of items in WorldCat. It 
is not sufficiently large enough to include many outstanding 
but extremely rare types of items, such as incunabula, paleo- 
graphic writing boards, or items in hieroglyphics. 

Findings 

Types of Documents and Materials 

In order to comprehend WorldCat as a database, one must 
sort out the various kinds of items it contains. The Anglo- 
American Cataloguing Rules, 2nd edition, 2002 revision, 
differentiates in its table of contents the following catego- 
ries of items for cataloging: books, pamphlets, and printed 
sheets; cartographic materials; manuscripts (including man- 
uscript collections); music [printed notated music]; sound 
recordings; motion pictures and videorecordings; graphic 
materials; electronic resources; three-dimensional artifacts 
and realia; microforms; continuing resources [serials]; and 
analysis [monographic or journal analytics]. 15 

Such a synopsis is typical. However, it is inadequate as a 
categorical scheme, as tire individual categories are not mutu- 
ally exclusive. Books, for example, can be manifested not only 
on printed paper but also in sound recordings, microforms, 
and electronic resources. Indeed, most document types can 
appear in a number of material formats. To avoid confusion 
in collection analysis, a system that creates nonoverlapping 
categories by distinguishing explicitly between document 
type and material type must be established. 

The use of the terms document type and material type 
is intended to emphasize a sharp conceptual distinction that 
is absent from most discussions of the categories of items in 
catalogs and bibliographic databases. Without this distinc- 
tion, any discussion of the categories of items in libraries and 
catalogs is bound to be muddled. 

Document type is primary, and material type is subsid- 
iary. Document type sums up the general, overall character 
of a work in a basic, salient, overriding category. The major 
document types, such as book, serial, manuscript, sound 
recording, and motion picture, are sociocultural categories 
characterized by prototypical cores but fuzzy and ambigu- 
ous boundaries. 16 Material type denotes not only the physi- 
cal medium (e.g., paper, magnetic tape, celluloid film), but 
also the mode of communication — the medium in which 
the content is presented. Thus, printed language material, 
printed cartographic material, and printed musical mate- 
rial are distinct material types. Similarly, atlases and scores 
on microfilm are separate material type categories from 



microfilm books. Material type differentiates among various 
specific microformats (e.g., microfilm, microfiche, micro- 
opaque); among formats for reproducing sound, visual, or 
graphical data; and among various electronic formats and 
means of access to computer files (Web, file transfer proto- 
col, Usenet, diskette, CD-ROM, and so on.). 

Analyzing the makeup of OCLC WorldCat in terms 
of document type and material type involves both the 
aggregation and separation of categories. Thirteen docu- 
ment types and 22 material types may be identified in the 
sample. Combining document types and material types, 
bibliographic records from the sample of 500 can be placed 
into 30 mutually exclusive categories, plus a null category 
consisting of 5 numbers matching no records. Items are 
by no means distributed evenly among categories, but are 
heavily clustered in a few categories, with a few scattered 
entries among all the other categories (see table 1). Printed 
language books (hereafter print books) alone account for 
more than two-thirds of all items. 

For both document type and material type, the pre- 
dominating type is responsible for more than 70 percent of 
records and is about ten times more prevalent than the sec- 
ond most common type. The 2 leading types in both catego- 
ries are closely related to each other: the leading document 



Table 1 . Document types and material types in WorldCat 
sample (n=495) 



Document type No. % 

Books 364 73.5 

Manuscripts (text) 40 8.1 

Serials 25 5.1 

Scores 15 3.0 

Musical recordings 12 2.4 

All others 39 7.9 
Material type 

Printed language 355 71.7 

Original language 36 7.3 

Microfilm language 2 1 4.2 

Printed notated music 14 2.8 

Microfiche language 10 2.0 

All others 59 11.9 

Combined document: material type 

Books: Printed language 335 67.6 

Manuscripts: Original language 33 6.7 

Serials: Printed language 18 3.6 

Books: Microfilm language 15 3.0 

Scores: Printed notated music 14 2.8 

All others 80 16.2 
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type is the book, and the leading material type is printed 
language material, while in second place are manuscript and 
original language material respectively. The same ratio holds 
for the 2 leading combined document-material types: there 
are ten times as many print books as original language mate- 
rial manuscripts. Together, these 2 kinds of items account 
for almost 75 percent of all items in the database. 

Categories of Holdings Levels 

This study aims to determine the proportion of items in the 
catalog that are widely held and to analyze the characteris- 
tics of these widely held items. Determining a cut-off point 
for high holdings to define a category of widely held items 
is an arbitrary procedure that can lead to circular reasoning. 
One cannot know in advance what a high level of holdings 
is. Because this paper is a first attempt to determine the 
occurrence of widely held items in context of the totality 
of merged online catalogs, predefining the category seems 
unwise. It is somewhat easier on an intuitive level to grasp 
the meaning of scarce items than common items (though 
here too the cut-off point is arbitrary). Therefore, rather 
than attempt to provide a parameter for widely held items, I 
will set these items aside for the time being by dividing the 
sample into four categories: 

• Nonexistent: items with holdings 

• Unique: items with 1 holding 

• Scarce: items with 2 to 50 holdings 

• Non-scarce: items with more than 50 holdings 

In this typology, non-scarce is a residual category and will be 
addressed later in this paper. 

The reader may be surprised at these categories, par- 
ticularly die first. However, die typology arises from the data 
themselves, so the distribution of holding levels itself is the 
real surprise: 



Nonexistent items: 6 (1.2 percent) 
Unique items: 197 (39.8 percent) 
Scarce items: 247 (49.9 percent) 
Non-scarce items: 45 (9.1 
percent) 



tribution for all items and for the top five document types, 
which make up more than 92 percent of the sample. 
Government publications, which account for 7.3 percent of 
the sample, break down in approximately the same propor- 
tions as other items. 

Based on the sample, 1.2 percent of items cataloged 
in WorldCat have absolutely no holdings; they are not held 
by the libraries that had cataloged the items and input die 
records. These appear to be items that once were held but 
that were deaccessioned, as none of the records are based 
on prepublication data or unexamined material. However, 
the bibliographic records that should have been deleted or 
reported as errors persist, and the items must be considered 
nonexistent in terms of library collections. For 5 of the 6 
items with holdings, other records with holdings for items 
with the same title are present; all items are extremely 
scarce, except for one that is cataloged in the sample as a 
book but that is held as a microform serial by 17 libraries. 
In addition to the 6 records that have holdings, 5 numbers 
used to obtain sample records do not match any records. 
The percentages given here are percentages of the sample 
of 495 records, not of all the 500 random numbers used to 
generate the sample, unless otherwise noted. 

In this paper, unique means that only one library that 
is a participating member of OCLC owns the item. Unique 
does not necessarily mean that no other library in the world 
owns the item, much less that it is an absolutely unique doc- 
ument. In analyzing die percent of unique items, the differ- 
entiation of original or archival materials, which one would 
expect to be unique from other categories of materials is 
advantageous. Original and archival materials in the sample 
include language and music manuscripts, mixed material 
collections, and a graphic collection. Forty-four items in 
the sample are archival, and 41 of these are unique. All but 
one of the language manuscripts are academic theses, and 
the exception appears to be an incorrectly cataloged book. 
The great majority of theses are for degrees from United 
States institutions and are in English. Non-archival materials 
account for 451 items in the sample and, of these, 156 (34.6 
percent) are unique. Archival materials account for only 8.9 
percent of all materials in the sample but 20.9 percent of die 



Table 2. Distribution of holding levels categories by document type (n=495) 



Non-scarce items account 
for a relatively small fraction of 
the sample, and this finding jus- 
tifies not subdividing the non- 
scarce portion into thinner layers 
at the outset. 

These distributions vary by 
both document type and mate- 
rial type. Table 2 shows the dis- 



Document type 


Nonexistent 


Unique 


Scarce 


Non-scarce 


Total 


Books 


4 


118 


200 


42 


364 


Manuscripts 





36 


4 





40 


Serials 





11 


13 


1 


25 


Scores 





7 


8 





15 


Musical recordings 


1 


3 


7 


1 


12 


All others 


1 


22 


15 


1 


39 


Total 


6 


197 


247 


45 


495 
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unique content. Besides archival materials, the category of 
unique items includes many of the items outside the print 
book category as well as many technical reports and items of 
only local significance. 

Table 3 shows the distribution of frequency by material 
type categories. It contrasts print language material, which is 
the predominant category, making up almost three-quarters 
of the total, against all archival materials and the residual 
category of all other materials, which includes microforms, 
computer files, maps, sound recordings, and so on. 

Unique items are more likely than items in other hold- 
ing level categories to be short in length. Of unique print 
books, 36.2 percent have fewer than 50 pages (or leaves), 
and 40.9 percent are more than 100 pages long. This com- 
pares to 27.2 percent and 56.1 percent, respectively, in all 
holding level categories. Table 4 shows the distribution of 
print books in each holding level category according to 
page length. 

In a study of nearly 3.4 million items in WorldCat, 
Perrault found that 53.1 percent of all records for mono- 
graphs were unique. 1 ' Presumably, Perrault's monographs 
are identical to my print books, though the term also could 
refer to anything not cataloged as a serial. Only 39.8 percent 
of all items in the present research sample, including 31.3 
percent of print books, are unique. 

Scarce is defined as having greater than 1 but no more 
than 50 holdings; 49.9 percent of items are in the scarce cat- 
egory. Although a few items cataloged as archival materials 
have multiple holdings in very low numbers and thus may be 
considered scarce, most scarce items are non-archival. 

Holding levels are strongly correlated with the pres- 
ence of call numbers. One hundred fifty-eight records 



Table 3. Distribution of holding levels categories by material type (n=495) 



Material type 


Nonexistent 


Unique 


Scarce 


Non-scarce 


Total 


Printed language 


4 


118 


197 


40 


359 


Archival 





41 


3 





44 


Other 


2 


38 


47 


5 


92 


Total 


6 


197 


247 


45 


495 




Table 4. Length in pages of print books categorized by holdings level category 


Holdings 


1-50 




More than 


Unknown page 




level 


pages 


51-100 pages 


1 00 pages 


length 


Total 


Nonexistent 


2 





l 


1 


4 


Unique 


38 


14 


43 


10 


105 


Scarce 


50 


21 


109 


7 


187 


Non-scarce 


1 


2 


35 


1 


39 


Total 


91 


37 


188 


19 


335 



(31.9 percent of the total) have no call numbers of any 
kind, but only 2 are non-scarce. Of unique items, 46.2 
percent lack call numbers (very close to the 47.6 percent 
of unique monographic titles with no call number found by 
Perrault), and for scarce items, the figure is 25.5 percent. 18 
Half of all items have a Library of Congress Classification 
(LCC) number assigned either locally or by the Library of 
Congress (LC). Adding National Library of Canada num- 
bers, 52.5 percent of items have at least one LCC-type call 
number. Dewey Decimal Classification (DDC) numbers, 
either LC-assigned or locally assigned, are present in 30.7 
percent of items. For all items, the ratio of LC-assigned to 
locally assigned LCC numbers is approximately 1:1, but 
for DDC numbers the ratio is greater than 2 : 1 (see table 
5). For unique items, however, locally assigned numbers 
greatly outnumber centrally assigned numbers in both 
classification systems. 

Whereas scarce items frequently have incomplete cata- 
loging, the records for non-scarce items tend to be the full- 
est, most complete records. They are far more likely than 
others to have been cataloged at the national level, meaning 
they have been both cataloged and transcribed by national 
bibliographic agencies, as indicated in the MARC 040 field 
(cataloging source) by codes for national cataloging agen- 
cies — the Library of Congress (DLC), the British Library 
(UKM), the National Library of Canada (NLC), the National 
Library of Medicine (NLM), and others. For example, an 
item cataloged and transcribed by LC would contain the 
code "DLC |c DLC" in the MARC 040 field. Compared to 
18.6 percent of scarce titles and just 7.7 percent of unique 
titles, 58.7 percent of non-scarce titles are so cataloged. 
For titles with holdings of more than 100, the propor- 
tion is 81.8 percent. Excluding 
archival materials, 20 percent 
of records are cataloged and 
transcribed by national biblio- 
graphic agencies. Print books 
account for 90.9 percent of 
items cataloged at the national 
level. Table 5 shows that call 
numbers for non-scarce items 
are far more often created by 
LC than locally; 78 percent of 
items have LC-created num- 
bers (LCC or DDC), as com- 
pared to 9 percent with locally 
created numbers. Non-scarce 
items often have specialized call 
numbers in addition to ordinary 
call numbers. Reflecting the 
dominance of Anglo-American 
institutions in WorldCat par- 
ticipation, Universal Decimal 
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Table 5. Distribution of types of call numbers in frequency categories 
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Classification (MARC 080 field) or "other" (MARC 084 
field) numbers are found only in unique items. 

Non-scarce items make up only 9.1 percent of the total; 
93 percent of these are in the book document type, and 89 
percent are print books. Besides print books, non-scarce 
items include 2 microfilm reproductions of original books 
dating from 1665 and 1894, a map, a musical recording on a 
compact disc, and an online journal. Considering that these 
document and material types occur far less frequently than 
print books, one cannot conclude from the low numbers 
that items of these kinds are scarce in greater proportions 
than are print books. 

Countries and Regions of Origin 

Given OCLC's stated objective of the global networking 
of information, United States dominance as die place of 
origin of items in WorldCat (despite the representation of 
many countries in all regions of the world) is noteworthy. 19 
Items originate in 54 countries, an impressive number for 
a sample this size. Twenty-four countries are represented 
by one item apiece, and 19 countries by between 2 and 5 
items. The United States alone is responsible for 195 items, 
or 39.4 percent. 

In second place, with 60 items, or 12.1 percent, are 
materials with no place or an unknown or undetermined 
place of origin, as indicated by "Ctry: xx" in the fixed field 
of the MARC record. Most of these are archival or unpub- 
lished materials, especially manuscripts, which as a rule 
do not name a country of origin. However, odier items for 
which the place of publication or origin is not recorded or 
cannot be determined are coded the same way. Apart from 
manuscripts (language and music), 4.2 percent of records do 
not name a country of origin. 

Limiting the materials to print books, 52 countries 
are represented in all (see figure 1). Twenty-three of these 
are responsible for only 1 book each, and another 16 are 
responsible for between 2 and 5 books each. The United 
States, England, and 8 other countries (mostly in Western 
Europe) are responsible for more than three-quarters of all 
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Figure 1. Distribution of print books in WorldCat sample by 
country of publication (n=335) 
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Figure 2. Distribution of items in WorldCat sample by region of 
origin (n=495) 



print books in the sample. In the absence of manuscripts, 
archival materials, and computer files, only 3.0 percent 
have no known or determined country of origin, as com- 
pared to 30.6 percent in all other categories. 
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Looking at WorldCat in terms of regional representa- 
tion (see figure 2), North America (defined as die United 
States and Canada), Western Europe, and the British Isles 
(including the Republic of Ireland) are dominant, while 
Africa and all of Asia except for the Far East are only very 
slightly represented. 

Microfilm and other reproductions usually state the 
country of origin of die original document, not the copy, at 
least in the fixed field of the MARC record, which is the 
preferred source of information on countries for this study. 
Bearing in mind diis proviso, 31 of 45 non-scarce items (69 
percent) originated in die United States, followed by eight 
(18 percent) from England; die remaining items (1 each) 
originated in Canada, France, Germany, Japan, Spain, and 
Scotland. All nonexistent items originated the United States. 

Languages 

Official information on the number of items in various 
languages in WorldCat, as of July 30, 2004, is available on 
OCLC's Internet page, "WorldCat facts and statistics." 20 
Based on figures for the total number of items in the data- 
base presented therein, one can calculate the fraction of 
items in each language. Comparison of these statistics to 
those for the top 9 languages in the research sample (see 
table 6) lends assurance that die sample is representative 
of the whole, though noticeable differences exist concern- 
ing the proportions of items in French and Japanese. The 
reader is referred to the Web site for figures for the top 53 
languages (Serbo-Croatian is counted twice, differentiating 
between Romanic and Cyrillic forms). The sample of 495 
includes items in 31 languages (including both forms of 
Serbo-Croatian) and 2 items in multiple languages. Three 
percent of the sample, chiefly musical scores and record- 
ings, are nonlingual. Of the 27 items in those categories, 13 
(48 percent) have no language. 

The last column of table 6 provides percentages for 
print books. It shows that the percentage of print books 
in English is significantly smaller than the percentage of 
all items in English while the percentages in French and 
German are considerably higher for print books than for all 
formats. This appears to be the effect of the large number 
of records contributed by academic libraries in the United 
States and other Anglophone countries for academic theses 
that are overwhelmingly in English and excluded from the 
print book category as these are cataloged as manuscripts. 

Intensity of language representation and holding levels 
are strongly correlated, with the most frequently occurring 
languages having the highest holdings. The 305 English- 
language items can be divided into the frequency catego- 
ries as follows: 6 (2 percent) nonexistent, 127 (42 percent) 
unique, 130 (43 percent) scarce, and 42 (14 percent) 
non-scarce. By comparison, of 190 items not in English, 



69 (36 percent) are unique, 117 (62 percent) are scarce, 
and only 4 (2 percent) are non-scarce, including titles in 
French, German, and Spanish, as well as one nonlingual 
musical recording. 

Material Age 

To analyze the age structure of WorldCat, all serial or con- 
tinuing items as well as undated items are removed from 
the sample, resulting in 460 items. The sample is divided by 
century, further breaking down the twentieth century into 
time periods for the first half, third quarter, and fourth quar- 
ter. Separate categories are created for items diat have been 
reproduced in periods different from the original period of 
publication. The results appear- in table 7. The period of 1976 
through 2000 is the largest category, with 208 items, which is 
45.2 percent of dated, non-serial and non-continuing items 
in the sample. This time period accounts for 69.6 percent of 
non-scarce items. 

Holdings Distribution 

The simplest description of the holdings distribution of 
sample records is that the greater the number of holdings, 



Table 6. Representation of languages in WorldCat and in 
research sample by percentage 
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Table 7. Date of publication of non-serial and non-continuing 


items (n=460) 
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the fewer the number of items having that many holdings. 
There are many records with low holdings, with the greatest 
number having a single holding, and decreasing numbers of 
records with higher numbers of holdings, followed by a few 
records with a large number of holdings. The first part of the 
curve can be seen as a reverse J-shaped curve with a long 
tail, as shown in figure 3, which provides the frequency of 
records with 1 to 40 holdings. But to say there is a long tail 
is insufficient, as this does not begin to suggest the relatively 
small but significant number of records that have much 
higher levels of holding. 

Although any cut-off point is arbitrary, one may visual- 
ize the sample in two sections, as in figures 4 and 5. The 
first group is the large number of items with few holdings, 
and the second is the group with few members, but with 
elevated numbers of holdings. Two-thirds of all the records 
(328 out of 495) have to 5 holdings, while the remaining 
one-third have more than 5 holdings. These categories are 
analyzed separately. 

For the first group, one can apply a modified version of 
Lotka's law, which states that "the number of authors who 
have published a specific number of papers [is] approxi- 
mately equal to die inverse square of that number mul- 
tiplied by the number of authors who had published one 
paper only, that is, [if one sets] f (y) as the number of authors 
publishing y papers, . . . [then] f (y)-!/)/ 2 x f (l)." 21 One can 
adapt this law by substituting records for authors and hold- 
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Figure 3. Frequency of records with to 40 holdings (n=445) 
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ings for papers. Given the size of the sample, this formula 
provides a reasonably close approximation for the fraction of 
items with 2 to 5 holdings (see figure 6). 

For the second group, with elevated numbers of hold- 
ings, a different form of analysis is necessary. Here, one 
may divide the group into zones containing equal numbers 
of records to see what the range of holding levels are for 
each zone. Keeping in mind the geometric growth seen 
in the curve in figure 5, and following Price's law, which 
states that "die number of prolific audrors in a subject area 
(i.e., producing about half the publications in the field) [is] 
approximately equal to the square root of the total number 
of authors," a zone is defined as a group containing a num- 
ber of items equal to the square root of the entire sample 
(500), which is slightly more than 22. 22 Not all zones contain 
exactly 22 items, as the size is adjusted to fit all items with 
the same number of holdings. In all, there are 8 zones in 
all of the items with holdings of more than four, as seen in 
table 8. Unexpectedly, zone 1, the highest zone, defined as 
the zone with the 22 items with the highest level of holdings, 
consists of items (all print books) with more than 100 hold- 
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Figure 5. Occurrence of records with 18 or more holdings 
(n=78) 




Figure 4. Frequency of records with to 1 7 holdings (n=41 7) 



Figure 6. Distribution of items with 1 to 5 holdings (n=287) 
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ings. Zone 2 contains items with 52 to 100 holdings; zone 
3 contains items with 25 to 51 holdings; and so on. In all 
zones, die majority of items are print books. 

Table 8 shows the range of holdings levels in each 
zone, a range that becomes wider as the number of items 
increases. This analysis of zones helps segment the large and 
heretofore undifferentiated category of scarce items. 

The number of widely shared items in OCLC WorldCat 
as a whole cannot be estimated from the square root of total 
number of items in the database. Given a database size 
of 54 million, this would mean that only 7,550 items have 
more than 100 holdings, an absurdly small number. On the 
contrary, the fraction of items in the sample with more than 
100 holdings, 4.4 percent, is a much more reliable figure for 
projecting the number of items with more than 100. Using 
this percentage, one can estimate 2,376,000 records with 
more than 100 in a database of 54 million. The modified 
equivalent of Price's law is used simply to suggest that the 
number of items with more than 100 holdings is approxi- 
mately the same as the number with 5 to 6 holdings, 7 to 
8 holdings, and so on, up to about 52 to 100 holdings. The 
highest zone is limited only by the number of participat- 
ing libraries; the second-highest zone has an interval of 49, 
and the third-highest zone has an interval of 27. As zones 
encompass materials with higher holdings, intervals of 
ranges grow exponentially. Excluding the first zone, whose 
wide range dwarfs all other zones, figure 7 graphs the inter- 
nal sizes for 8 to 2. 

Characteristics of Widely Held Titles 

Forty-five items in the sample (9.1 percent) are non-scarce, 
with more than 50 holdings each. However, the previous 
analysis shows that this category constitutes quite neatly 
the top two zones, with a residue of just 1 item in the third 
zone. At this point one can focus on the highest zone, which 
consists entirely of print books in English, dividing it even 
further into two equal halves to arrive at a core consisting of 
the most ubiquitous items. Dividing the top zone into two 
halves of 11 items each results in group A, with 381 to 1491 
holdings, followed by group B, widi 105 to 335 holdings. 
The size of the sample does not enable pinpointing with 
precision the lower boundary of this highest fraction, and 
one ends up widi a gap of 46 between the two sub-zones. 
Given a margin of error of 4.4 percent, one should prob- 
ably say only that highly ubiquitous holdings begin at about 
360, plus or minus 40. With 11 items, group A makes up 2.2 
percent of the sample. Remarkably, this is the same percent- 
age that has either no matching record or a record with no 
attached holdings. Even more striking is that White, in a 
study of library collections, found the midpoint for library 
holdings to be about 400; furthermore, he asserts that this 
figure of 400 library holdings represents the dividing line 
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Figure 7. Increasing interval sizes of zones 8 to 2 



between highly specialized books and books aimed at more 
general readership. 23 This suggests that fewer than 5 per- 
cent of items in WorldCat account for die great majority of 
library holdings. 

A look at die items in groups A and B supports White's 
statement. All items in group A are published in the 
United States. Five are works of fiction. One is a transla- 
tion. Publication dates range from 1954 to 2003. Publishers 
include industry giants — Warner Books, Houghton Mifflin, 
Viking, Basic Books, and St. Martin's (with two entries 
in this group) — along with a major scholarly publisher, 
Princeton University Press, as well as Orchard Books, a divi- 
sion of the Watts Publishing Group. The occurrence of two 
less-prominent publishers of group A books, the Center for 
Urban Policy Research at Rutgers University and John F. 
Blair (located in Winston-Salem, North Carolina), suggests 
that many relatively small publishers have at least some titles 
with high levels of holdings. 

Most group B books seem to be located in a dif- 
ferent sector of the book industry than group A books. 
Publication dates range from 1972 to 2002. Three of the 11 
titles are published in England. This group includes 1 title 
each in the fiction, juvenile, and government publication 
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categories. In order of descending ubiquity, publishers of 
group B titles include Routledge, W. B. Saunders, Prentice- 
Hall, World Publishers, Kar-Ben Copies, Scholar Press/ 
Ashgate, CRC Press, Overlook Press, Oneworld, Paulist 
Press, and the United States Government Printing Office. 
Most of these publishers are not in the trade or mass-mar- 
ket sectors of the publishing industry, but are dedicated to 
educational and scientific subjects. 

Changes in Holdings over Time 

By marking changes in the holdings of WorldCat records 
after a lapse of time, one can detect activity in the database. 
Ideally, one would like to know whether items circulated or 
were consulted, but WorldCat does not provide that infor- 
mation. Items whose holding levels have changed (upwards 
or downwards) over time are active; the others are static. 
As one might expect, items with the most holdings are the 
most active, while those with the fewest holdings are the 
least active. Surprisingly, however, items at all holding levels 
except nonexistent experience change over time. 

Five months after first obtaining data on holding levels, 
each record in the sample was checked. Forty-six items (9.3 
percent of the sample) had added holdings between July 
and December of 2004, while 24 (4.8 percent) had lost hold- 
ings during the same period of time. The largest number of 
holdings gained by any item was 91, while the largest num- 
ber of holdings lost was 8. 

Group A experienced the most marked change, with 
5 items gaining and 5 losing. Only 1 item in this group did 
not change. In group B, 4 items gained and 3 lost. In the 
non-scarce category as a whole, 15 items gained in holdings, 
while 16 lost holdings. Therefore, 69 percent of non-scarce 
items changed in the absolute number of holdings in a five- 
month period. All but 3 of these items were print books. 

Thirty items with scarce levels of holdings gained, while 
8 lost. This means that 15.4 percent of scarce items changed 
in the number of libraries holding the item. Twenty-six of 
the scarce items that gained in holdings were print books. 
Only 2 unique items gained in holdings during the course 
of the study. While these items are exceptional, the study 
finds that activity is not exceptional at the level of 2 holdings. 
Indeed, 3 items that were not unique at the beginning of the 
study became unique by the end of the study by being de- 
accessioned from libraries that previously owned them. This 
study suggests that unique items are occasionally removed 
from holding libraries without deleting the records, result- 
ing in nonexistent items. 

Widely Shared Titles and Collecting Levels 

This investigation finds that most items in WorldCat are 
rare and unusual items that do not have a place in most 



library collections and are unavailable for acquisition 
by libraries other than those that already own them. To 
use an oceanographic metaphor, most known content in 
WorldCat consists of the small fraction of materials at or 
near the surface, while a far greater amount of content is 
unknown, lying undisturbed deep below the surface and 
toward the ocean floor. The majority of items in WorldCat 
are extremely scarce, with two-thirds held by no more 
than 5 participating libraries. Even excluding manuscript 
and other archival materials (which make up a consider- 
able portion of WorldCat), nearly 75 percent of items are 
held by a maximum of ten participating libraries. Although 
extremely scarce items are sometimes added to library col- 
lections, only a small fraction of WorldCat items are widely 
distributed among libraries and play a role in selection, 
acquisition, and collection management. 

White has suggested that OCLC (i.e., WorldCat) hold- 
ings data can be used as a guide for collection develop- 
ment by checking a sample of items against the number of 
holdings in WorldCat to determine those items' collecting 
levels in terms of the RLG Conspectus. 24 The Conspectus 
provides guidelines for the acquisition of library materials, 
differentiating between minimal, basic information; study or 
instructional support; research; and comprehensive levels. 
These levels refer to libraries' objectives for specific call 
number ranges and subject collections, not whole libraries, 
and it should be noted that some levels have gradations and 
qualifications. 25 White developed what he called rules of 
thumb for determining the collecting level in these terms 
of given items. He reckoned that books with fewer than 150 
holdings were at the research level; those with between 150 
and 400 holdings were at the instructional level; those with 
between 400 and 750 holdings were at the basic information 
level; and those with more than 750 holdings were at the 
minimal level. 26 

From 2001 through 2002, Lesniaski reexamined the 
holding counts of items in White's sample lists, finding that 
the number of holdings for titles in each collecting level 
had risen significantly. 2 ' By adjusting each collecting level 
upward, he found that the approximate ratios between levels 
remained approximately constant. By Lesniaski's new rules 
of thumb, level 1 (minimal level) corresponds to WorldCat 
holdings above 1,000; level 2 (basic information level) cor- 
responds to between 500 and 1,000 holdings; level 3 (study 
or instructional support level) corresponds to between 200 
and 500 holdings; and level 4 (research level) corresponds to 
fewer than 200 holdings. 28 The average numbers of holdings 
of titles at these levels are 1,541, 751, 389, and 153 respec- 
tively, producing ratios of 10 : 4.9 : 2.5 : l. 29 

The cut-off point for group A (ubiquitous items) is 
near the average for the study or instructional support 
level. Because the RLG Conspectus allows for the division 
of this level into two sublevels, 3a (study or instructional 



50(2) LRTS 



From the Ubiquitous to the Nonexistent 89 



level, introductory) and 3b (study or instructional level, 
advanced), one might connect group B with collecting 
levels 3b and 4, and group A with levels 3 a and below. The 
category of ubiquitous holdings, therefore, encompasses the 
Conspectus introductory instructional or study level, the 
basic level, and the minimal level. 

One would like to know the relative proportions of 
items in WorldCat at die various collecting levels. Only 1 
item in the sample has more than 1,000 holdings and thus 
qualifies through Lesniaski's rule of thumb as minimal level. 
Six items are at the basic information level, having holdings 
between 500 and 1,000. Ten have holdings between 200 and 
500, thus qualifying for the study or instructional support 
level. Although the maximum for die research level is set 
by Lesniaski at 200, and the mean is known to be 153, the 
line dividing the research level and the comprehensive level 
(level 5 of the RLG conspectus) is not established. White 
suggests that the research level goes down to 1 holding, and 
does not define a range for Level 5 tides, as he views the 
comprehensive level as "gap-filling over Levels 1 to 4. . . . 
For libraries already at Level 4 and seeking comprehensive- 
ness, 'Level 5' titles are simply any remaining desiderata." 30 
But the present study suggests that most unique and scarce 
items in WorldCat have either been superseded by more 
recent publications or pertain to such localized interests that 
they have little research significance beyond the communi- 
ties in which they were produced. Based on an admittedly 
subjective and casual examination of bibliographic records 
from the sample, most print books in English with hold- 
ings fewer than 40 are not at the research level: 40 seems 
to be a reasonable cut-off point for such items. However, 
several other items with holding levels as low as 17 — print 
books in languages other than English, along with items in 
other document and material types — also seem to be at the 
research level. Finally, academic theses (manuscripts) at 
the doctoral level with 1 holding should by definition have 
research value, and thus count as being at the research 
level, though whether libraries would seek to collect them 
is another question. White is correct if he means that items 
widi 1 holding could be at the research level, and support 
for his view can be found in Perrault's discoveiy that 63.5 
percent of monographic titles in research libraries are 
indeed unique. 31 But a qualitative examination of my sample 
suggests that, in practical terms, only a small fraction of 
the many items with fewer than 40 holdings actually are 
at the research level. If one accepts my suggestion of 40 as 
the lower cut-off point for the research level, and accepts 
that conspectus levels can in fact be equated with ranges of 
WorldCat holding levels, then the sample shows ratios for 
proportions of items at the various levels, from the minimal 
to the research levels, as 1 : 6 : 10 : 28. If, however, one 
accepts White's notion that the research level goes down to 
1, the ratios are 1 : 6 : 10 : 478. 



Summary and Conclusion 

This study has analyzed a sample of bibliographic records 
in WorldCat to determine the proportion of items that have 
ubiquitous (widely shared) holdings versus lower levels 
of shared holdings. It left die term ubiquitous undefined, 
and worked with the heuristic categories of unique items, 
scarce items (items held by between 2 and 50 libraries), and 
non-scarce items (items held by more than 50 libraries). In 
these terms, the study finds that the large majority of items 
are either unique or scarce, with only 9.1 percent of items 
non- scarce. 

Analysis of a random sample of 500 records in 
WorldCat shows that 2.2 percent of the sample is empty, 
matching no record or a record with no attached holdings. 
At the other end of the spectrum, the same percentage of 
the sample is matched by what may be called ubiquitous 
items. This uppermost fraction is obtained by taking the 
square root of the entire sample (22 for a sample of 500), 
using a formula devised by the information scientist Derek 
de Solla Price, to divide the sample into zones to account 
for items held by many libraries. 32 The highest zone, using 
this formula, consists of items with more dian 100 hold- 
ings. Dividing this zone into halves results in a subzone 
of ubiquitous items consisting of items held by more than 
380 libraries (out of more than 9,000 member libraries 
contributing to WorldCat). The items in this category con- 
sist of print-format books in English, all published in die 
United States in the last half century, mainly by giant mass- 
market publishing houses. Virtually all are cataloged and 
transcribed at die national level. Following a rule of thumb 
stating that the more ubiquitous an item is, the more basic 
it is as an educational or informational resource, the dividing 
line between ubiquitous and nonubiquitous items corre- 
sponds approximately to the line between introductory and 
advanced levels of instructional support, by the standards of 
the RLG Conspectus. Only .02 percent of items are at the 
minimal Conspectus level. 

The distribution of items in the sample shows an 
inverse geometric relationship between the number of 
items at a given level of holdings and the number of libraries 
holding items, with the largest fraction (39.8 percent) held 
by just one library. Proportions of items with higher holdings 
can be described by Lotka's law, using die inverse square of 
the items widi one holding. 33 While the fraction of unique 
items seems high, it is substantially lower dian was found in 
a recent and much larger study. 34 Moreover, 20.9 percent of 
unique titles are actually archival materials (mainly manu- 
scripts); bracketing this portion of the sample, only 34.6 
percent of remaining items are unique. 

This paper presents a categorical scheme accounting 
for the distribution of items with various levels of holding 
by member libraries — unique, scarce, and non-scarce — 
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along with a system of zones accounting for levels of non- 
scarce and ubiquitous items. Used with other quantitative 
and evaluative measures, these categories and levels are 
helpful in assessing individual library collections and online 
union catalogs. 

References 

1. Judith J. Senkevitch and James H. Sweetland, "Evaluating 
Public Library Adult Fiction: Can We Define a Core Collec- 
tion?" RQ 36, no. 1 (Fall 1996): 103-17; Judith Serebnick, 
"Selection and Holdings of Small Publishers' Books in OCLC 
Libraries: A Study of the Influence of Reviews, Publishers, 
and Vendors," Library Quarterly 62, no. 3 (1992): 259-94; 
Debora Shaw, "An Analysis of the Relationship between Book 
Reviews and Fiction Holdings in OCLC," Library ir Infor- 
mation Science Research 13, no. 2 (1991): 147-54; Danny 
P. Wallace and Bert R. Boyce, "Holdings As a Measure of 
Journal Value," Library & Information Science Research 11 
(Jan. 1989): 59-71; Stephen E. Wiberley Jr., "The Humani- 
ties: Who Won the '90s in Scholarly Book Publishing," portal- 
Libraries and the Academy 2, no. 3 (July 2002): 357-74; 
Stephen E. Wiberley Jr., "The Social Sciences: Who Won 
the '90s in Scholarly Book Publishing," College <Lr Research 
Libraries 65, no. 6 (Nov. 2004): 505-23. 

2. Allan N. Mirwis, Subject Encyclopedias: User Guide, 
Review Citations, and Keyword Index (Phoenix, Ariz.: Oryx 
Pr., 1999). 

3. Howard D. White, Brief Tests of Collection Strength: A 
Methodology for All Types of Libraries, Contributions in 
Librarianship and Information Science, no. 88 (Westport, 
Conn.: Greenwood, 1995); Anna Perrault, Global Collective 
Resources: A Study of Monographic Bibliographic Records in 
WorldCat (report of a study conducted under the Auspices 
of an OCLC/ALISE Research Grant, July 2002), www.oclc. 
org/research/grants/reports/perrault/intropdf (accessed Oct. 
6, 2004). 

4. White, Brief Tests of Collection Strength, 135-36. 

5. Perrault, Global Collective Resources. 

6. Ibid. 

7. OCLC Online Computer Library Center, "WorldCat," 
www.oclc.org/worldcat (accessed Aug. 11, 2004). 

8. OCLC Online Computer Library Center, "Watch WorldCat 
Grow," www.oclc.org/worldcat/grow.htm (accessed Aug. 11, 
2004). 

9. OCLC Online Computer Libraiy Center, "WorldCat Gold 
Records," www.oclc.org/worldcat/goldrecords.htm (accessed 
Aug. 11, 2004). 

10. Thompson M. Little, "OCLC's International Initiatives 
and the Online Union Catalog," Cataloging ir Classification 
Quarterly 8, no. 3/4 (1988): 67-78. 

11. OCLC Online Computer Library Center, "OCLC Annual 
Report 2002-2003: Year in Review," www.oclc.org/news/ 
publications/annualreports/2003/yearinreview.htm (accessed 
Aug. 16, 2004). 



12. Ibid. 

13. Random.org — True Random Number Service, www. 
random.org (accessed Mar. 10, 2005). 

14. OCLC Online Computer Libraiy Center, "WorldCat." 

15. Anglo-American Cataloguing Rules, 2nd ed., 2002 rev. 
(Ottawa: Canadian Library Association; London: Chartered 
Institute of Library and Information Professionals; Chicago: 
ALA, 2002), v. 

16. See J. W. D. Dougherty, "Salience and Relativity in Clas- 
sification," in Language, Culture, and Cognition: Anthro- 
pological Perspectives, ed. Ronald W. Casson (New York: 
Macmillan, 1981), 163-80; George Lakoff, Women, Fire, and 
Dangerous Things: What Categories Reveal about the Mind 
(Chicago: Univ. of Chicago Pr., 1987); David B. Kronenfeld, 
Plastic Glasses and Church Fathers: Semantic Extension 
from the Ethnoscience Tradition (New York: Oxford Univ. 
Pr., 1996); Anna Wierzbicka, Lexicography and Conceptual 
Analysis (Ann Arbor, Mich.: Karoma Pub., 1985). 

17. Perrault, Global Collective Resources. 

18. Ibid., table 4-la. 

19. Jay Jordan, "Global Networking of Information — OCLC's 
Strategy for the Future" (paper presented at 7th International 
Bielefeld Conference, Feb. 3-5, 2004, Bielefeld, Germany), 
http://conference.ub.uni-bielefeld.de/proceedings/jordan.pdf 
(accessed Oct. 12, 2004). 

20. OCLC Online Computer Library Center, "WorldCat Facts 
and Statistics , " www. oclc . org/worldcat/statistics/ default .htm 
(accessed Sept. 1, 2004). 

21. Conception S. Wilson, "Informetrics," Annual Review of 
Information Science and Technology (ARIST) 34 (1999): 173; 
Alfred J. Lotka, "The Frequency Distribution of Scientific 
Productivity," Journal of the Washington Academy of Sci- 
ences 16, no. 12 (1926): 317-23. 

22. Dietmar Wolfram, Applied Informetrics for Information 
Retrieval Research, New Directions in Information Manage- 
ment, no. 36 (Westport, Conn.: Libraries Unlimited, 2003), 
46; Derek J. de Solla Price, Little Science, Big Science (New 
York: Columbia Univ. Pr., 1963). 

23. White, Brief Tests of Collection Strength, 135-36. 

24. Ibid. 

25. Anthony W. Ferguson, Joan Grant, and Joel S. Rutstein, 
"The RLG Conspectus: Its Uses and Benefits," College & 
Research Libraries 49, no. 3 (1988): 197-206. 

26. White, Brief Tests of Collection Strength, 127. 

27. David Lesniaski, "Evaluating Collections: A Discussion of 
Brief Tests of Collection Strength," College 6- Undergradu- 
ate Libraries 11, no. 1 (2004): 11-23. 

28. Ibid, 17. 

29. Ibid. 

30. White, Brief Tests of Collection Strength, 129. 

31. Perrault, Global Collective Resources. 

32. Price, Little Science, Big Science. 

33. Lotka, "The Frequency Distribution of Scientific 
Productivity." 

34. Perrault, Global Collective Resources. 



50(2) LRTS 



91 



Perpetual Access to 
Electronic Journals 

A Survey of One Academic 
Research Library's Licenses 

Jim Stemper and Susan Barribeau 

A perpetual access right to an electronic journal, defined as the right to perma- 
nently access licensed materials paid for during the pe riod of a license agreement 
(not to he confused with the right to copy journal content solely for preservation 
purposes), is a concern of increasing importance to librarians as academic librar- 
ies discontinue paper subscriptions and retain electronic-only access. This paper 
explores the current environment for perpetual access to electronic journals. The 
authors report on analysis of the contracts between a large, research-level univer- 
sity library and 40 publishers of electronic journals, as well as ten large electronic 
journal aggregators. The authors seek to determine the frequency of contractual 
provisions for permanent access rights for the years of active subscription in the 
event an electronic journal contract is terminated for any cause other than breach 
by the licensee. Costs and formats of any granted perpetual access are considered. 
The paper concludes with an exploration of the potential impact of the perpetual 
access clauses libraries are accepting in licenses, the possible lack of continuing 
access, and options for addressing the situation. 

Many, if not most, academic research libraries are engaged in canceling 
print journals for various reasons, including saving money and coping 
with escalating journal subscription prices, providing more immediate access to 
journal content, and alleviating shelving space problems. 1 This cancellation raises 
inevitable questions for the library: whether the print journal's online counterpart 
also will be canceled in the future to save money and whether ongoing access to 
subscribed content can be guaranteed. 
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Statement of the Problem 



Faculty View 



Faculty are taking note of the issue of long-term journal access. In a survey of 
7,400 American university faculty members conducted in fall 2003 by the non- 
profit Ithaka, "three-fourths said a journal should ensure that its archives will be 
preserved indefinitely" and "Eighty-four percent of the survey respondents said 
that archiving of electronic resources was very important to them." 2 This finding 
suggests that faculty do care about the long-term availability of electronic jour- 
nals (e-journals). The survey also echoes the findings of a study commissioned 
by the Journal Storage (JSTOR) Project in 2000, in which more than 4,000 
faculty members at American universities were anonymously surveyed about 
their use of electronic resources. 3 Seventy-six percent of the respondents agreed 
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that it is important "that electronic journals be preserved 
for the future." 4 

Research Library Mission 

Given the traditional stewardship mission of research librar- 
ies, one can understand why university faculty would look 
to librarians to preserve access to subscribed journals. In 
their collection development textbook Building Library 
Collections, Curley and Broderick emphasize the research 
library's stewardship role. 5 Several library authors have 
urged uhe profession to maintain this stewardship role in 
the Internet age. 8 

The Digital Library Federation (DLF) Electronic 
Resource Management Initiative (ERMI) defines a per- 
petual access right as "the right to permanently access the 
licensed materials paid for during the period of the license 
agreement"; this right is different than an archiving right, 
which is defined as "the right to permanently retain an 
electronic copy of the licensed materials."' The emphasis of 
the first term is on retaining access, not on how such access 
is achieved (through die publisher's site, a locally retained 
copy, or a third-party site). A literature review shows that 
people define e-journal archiving differently. Librarians and 
publishers frequently use the term archival access when 
often meaning perpetual access. The focus of this paper is 
not on preservation of subscribed e-journals per se, but on 
contractually assuring ongoing access to them. The chal- 
lenge of ensuring continued access to subscribed informa- 
tion resources for die future is at least partly in the hands 
of research-level academic institutions; therefore, research 
libraries should address the issue, make their policies clear 
for all to see, and hold to them. 

Previous License Reviews 

In 2001, Millett conducted a review of 61 publisher licenses 
for electronic content to ascertain whether continued access 
was addressed. 8 If available, the exact license wording was 
quoted, labeled "Yes" if perpetual access upon termination 
was granted and "No" if not; any explicit mention of finan- 
cial costs was noted as well. Out of 44 licenses with a "Yes" 
or "No" label (no reason was noted for the blanks on the 
other publishers), 22 were considered a "Yes" (half of the 
reviewed licenses) and 9 of those carried a charge (fewer 
than half of the "Yes" licenses). 9 Hughes analyzed the Web 
sites of 15 e-journal providers and found "no agreement 
on how to provide access to journals after a subscription 
is lapsed," though she predicted that "some sort of extra 
access fee so that lapsed users can access the volumes for 
which they had subscriptions may become the norm." 10 
The authors of this paper sought to update and expand this 
research by determining whether e-journal providers were 



willing to add perpetual access rights to an existing license if 
not already present, and to examine how issues of costs and 
format were addressed, if at all, in the license itself. 

Print Retention Projects 

Various consortia and national library organizations are 
pursuing print retention projects to ensure that someone 
somewhere still has a complete run of a journal's print 
copy, available through interlibrary loan (ILL) for consor- 
tia! members, should a library's licensed copy no longer 
be available. The Committee on Institutional Cooperation 
(CIC) retains print copies of Academic Press, Springer, and 
Wiley titles in at least one member library; the University of 
California (UC) system stores print copies of Association for 
Computing Machinery (ACM) and Elsevier Science titles 
in a centralized, shared archive. 11 To identify which print 
archive holds the last copies of specific titles, the Center 
for Research Libraries is developing a national registry 
system. 12 

McDonald cited die important role that consortial 
membership plays in retention decisions as such, "Many 
libraries consider themselves or are considered by a consor- 
tia or regional group to be the archive of record for all or a 
portion of their collective print collection. It will continue 
to be their responsibility to maintain print issues for those 
items that are their archival responsibility." 13 However, there 
are several drawbacks to print archives that must be consid- 
ered. Due to print subscription costs as well as attendant 
hosting, processing, and tracking commitments, print reten- 
tion projects may entail a significant monetary investment. 
Also, the Library Journal Academic Newswire paraphrased 
one participant (Baker, chair of the CIC Library Directors 
Group) as cautioning that "at some point, major publish- 
ers might choose to abandon print altogether, posing yet 
another challenge." 14 Such print retention projects are in 
their infancy, and the academic library community does not 
yet know whether document delivery from such reposito- 
ries (perhaps delayed, perhaps completely unavailable due 
to copyright clearance requirements) will be acceptable to 
users. Libraries determined to go e-only to save money may 
wish to allocate money for print retention projects and seek 
membership in a consortia that has such projects. 

E-journal Archiving Projects 

Simultaneous with these efforts to preserve a print journal 
archive, various projects have studied e-journal archive 
creation and maintenance. Extensive research efforts have 
assessed the scope of the e-journal archiving issue and 
proposed strategies for preservation of and long-term 
access to full-text content. 15 The Task Force on Archiving of 
Digital Information, under the auspices of the Commission 
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on Preservation and Access and the Research Libraries 
Group, issued an influential 1996 report. 16 Based on the 
findings of the task force's report, the commission's succes- 
sor organization, the Council on Library and Information 
Resources (CLIR), along with DLF and the Coalition for 
Networked Information (CNI), issued "Minimum Criteria 
for an Archival Repository of Digital Scholarly Journals," 
the sixth of which has special implications for licensing 
librarians: "A repositoiy will make preserved information 
available to libraries, under conditions negotiated with the 
publisher." 1 ' What type of access a lapsed subscriber might 
have at such an archive and what license wording is specific 
enough to ensure such access are the critical issues. The 
authors' analysis of one institution's licenses explores how 
these access issues manifest themselves in contracts. 

As part of its Archiving Electronic Journals project, 
the Mellon Foundation funded seven year-long studies that 
aimed to meet these repositoiy criteria, six of which involved 
large academic research libraries working in tandem with 
major journal publishers. 18 Most projects addressed issues 
related to technological infrastructure and workflow, as well 
as sustainable business and access models. Except for the 
publishers who worked with Stanford on the Lots Of Copies 
Keeps Stuff Safe (LOCKSS) project, none seemed open to 
the idea of access to die archive by lapsed subscribers. Users 
could have access to e-journal archives only under certain 
conditions usually called trigger events; conditions discussed 
between Harvard and Blackwell Publishing, the University 
of Chicago Press, and John Wiley and Sons typify diose cited 
in the project reports: 

1. When material is no longer accessible online from the 
publisher. 

2. When the publisher sells or otherwise transfers the 
rights to publish a given title to another body. 

3. When the material has been in the archive for "n" years 
("n" being a time period to be agreed to by Harvard 
and the publisher on a title-by-title basis). 

4. When the title ceases to be published. 

5. When the content enters the public domain. 19 

Elsevier stipulated that its prototype archive at Yale 
University Library could not compete with the publisher's 
revenue stream. 20 This archive's trigger events were corre- 
spondingly limited: users not affiliated with Yale could only 
access the archive on-site at Yale or in the event of a natural 
disaster, as ILL was not part of the archive's mission. 21 To a 
faculty member of an American university whose library was 
forced to cancel a needed ScienceDirect title for budgetary 
reasons, the prospect of making a trip to Yale University 
Library to photocopy an article is not an attractive or likely 
option. More practical alternatives for research libraries 
are needed. The University of Pennsylvania recommended, 



among other things, that further research in this area "deter- 
mine the relationship between events that would 'trigger' 
nonsubscriber access rights versus paid subscriptions." 22 
Short of each library setting up its own archive of a publish- 
er's journals, and in the absence of traditional ILL service, 
how could a lapsed subscriber obtain access to content for 
which the library had once paid without adversely affecting 
the publisher's bottom line? 

The Yale/Elsevier e-journal archiving project, while 
drawing no final conclusions about economic models pend- 
ing further study, identified preservation metadata as one 
way to add value to journal content, thus justifying some 
type of setup or maintenance fee. 23 The project report also 
observed that "the ongoing [archiving] costs, at least for 
standard publisher's journals, could be relatively predictable 
and eventually stable over time." 24 This outlook of feasibil- 
ity and stability is at least partially good news for library 
administrators. 

After sifting through the lessons learned from the 
Archiving Electronic Journals projects, the two models the 
Mellon Foundation felt worthy of continued funding were 
LOCKSS and Ithaka's Electronic Archiving Initiative. The 
key issues that led to this decision to fund two different 
archiving options revolved around cost and control — no one 
project's means of financing an archive was identified as 
being the clear - solution, and neither publishers nor libraries 
were willing to cede control of an archive to the other. 25 

The innovation of the LOCKSS model is its decentral- 
ized nature — allowing libraries to create locally cached cop- 
ies of subscribed e-journal content that they could deliver 
to authorized subscribers as needed with die sanction of 
participating publishers, with other institutions' peer-to- 
peer network of caches acting as backups. 26 With LOCKSS 
staff negotiating publisher participation, all the individual 
library needs to do is to get die publisher's blanket permis- 
sion to cache the title, which LOCKSS's already-negotiated 
license wording facilitates. 2 ' The LOCKSS site claims that 
"more than 80 libraries and 50 publishers . . . are using the 
software." 28 Like many projects, LOCKSS now needs secure 
funding to remain viable, so the new LOCKSS Alliance is 
seeking participation from new libraries who would pay 
annual membership fees. 29 

The second archiving model that Mellon is funding 
is Portico, a recently launched project of the nonprofit 
organization Ithaka (itself an outgrowth of JSTOR, the 
respected e-journal back-file archiving service). 3 " Also called 
E-Archive, Portico argues for a centralized archive of elec- 
tronic scholarly literature, with distributed costs shared by 
participating publishers and libraries. Portico justifies the 
centralized approach on the grounds that most publishers 
will not see a long-term business interest in maintaining 
and migrating e-journal data as the content gets older, and 
strained library acquisition budgets will not be able to cover 
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such costs. 31 Portico's proposed funding model is similar to 
JSTOR's; participating libraries are asked to pay a one-time 
archive development fee and an annual archive support fee. 
Participating publishers are asked to pay a one-time setup 
fee and then an annual deposit fee for each journal. 32 The 
funding model thus reflects the findings of Mellon journal 
archiving projects that journal archives have both initial 
setup and ongoing maintenance costs. The Portico site lacks 
specifics about the conditions under which one can obtain 
access to content. The mission page's statement that "access 
to the contents of the archive will be available under limited 
circumstances" may be cause for concern. 33 Exactly how 
limited are the circumstances? The JSTOR moving wall 
of content is mentioned on the Portico site, which appar- 
ently means that a library that has canceled a title may have 
to wait a publisher-specified number of years before the 
once-subscribed issues become available through Portico. 34 
What remains to be seen is whether most or all of a library's 
desired savings from going e-only is simply transferred to 
such an archiving service, and whether the limited years of 
Portico access are of sufficient value to users. 

If a critical mass of publishers joins the Portico project, 
the approach may offer a compelling alternative to LOCKSS. 
Publishers would not have to worry about losing control of 
their content in distributed LOCKSS caches; libraries would 
not have to setup and maintain local LOCKSS caches. 
However, LOCKSS co-founder Reich maintains that "only 
the largest publishers have sufficient resource to implement 
(or negotiate with third parties to provide) archives for 
content they publish," which makes LOCKSS's low cost a 
compelling option for smaller publishers. 35 Some publishers 
may opt for only one of the LOCKSS or Portico approaches 
(or neither), meaning that libraries might well end up with 
a hybrid approach to perpetual access, with implications for 
budgeting and presenting a coherent user interface: 

• local or regional print archive (when the publisher 
offers no perpetual access rights); 

• local data loading (e.g., LOCKSS cache); 

• access through a central subscription archive (e.g., 
Portico); or 

• access through the publisher's server, which may or 
may not entail a maintenance fee. 

Individual Library Actions 

Many individual university libraries and consortia are 
attempting to address this problem systematically through 
collection development policies governing the move to elec- 
tronic-only collections. 36 Whether labeled criteria, checklist, 
model license, or principles, these policies show that librar- 
ies need guidelines so they can safely make the move to 
e-only journals without jeopardizing long-term access. Such 



guidelines typically state the need for an explicit guarantee 
of perpetual access rights in the license should the library 
cancel a title in print or electronic format. Potential costs 
and hosting options for continued access also are frequently 
mentioned. Both the University of California (UC) and the 
United Kingdom's National Electronic Journals Licensing 
Initiative (NESLi2) for higher-education institutions state 
that perpetual access must be free, while the Canadian 
National Site Licensing Project states that members will 
have to pay a fee to the publisher. UC and NESLi2 also state 
that three hosting options are acceptable: access through 
the publisher's site, the library's site, or some type of central 
archive hosted by a third party. 3 ' Policies from several librar- 
ies and consortia stipulate that perpetual access should be 
guaranteed even when a journal title ceases publication or 
is transferred to another publisher. 38 

The University of Maryland's policy document provide 
a useful example in terms of detail and context. The criteria 
provide three scenarios under which one might consider 
going e-only (no publisher commitment, an expressed com- 
mitment from the publisher, and actual planning by the 
publisher for perpetual access), giving the selector progres- 
sively more leeway to go e-only as the publisher's commit- 
ment and the core nature of the title allow. Such tiered 
decision-making acknowledges that no single answer fits all 
situations, even within one library system. 39 

Guidelines are only as effective as the enforcement 
behind them. Are they ideals for which to strive, or required 
elements of any license? Some libraries have demonstrated 
the willingness to apply the guidelines in practice. The 
California Digital Library (CDL) publicly rejected the 
Nature Publishing Group's 2005 renewal terms because of 
the lack of guaranteed perpetual access and their belief that 
"Nature may change hands, or Nature may be archived by 
a third party, and UC does not expect to pay repeatedly for 
the same content." 40 Because consortia represent so many 
libraries and so much subscription income for publishers, 
they are well-positioned to exert market power and apply 
criteria for the granting of perpetual access rights. 

Surveys of Current Practice 

Anecdotal evidence from postings to the Electronic 
Resources in Libraries (ERIL-L) and Liblicense electronic 
discussion lists indicate that librarians are not taking pub- 
lishers up on their offer of LOCKSS access. 41 Shreeves, col- 
lection development officer (CDO) and associate director of 
the University of Iowa Libraries, said: 

I suspect that the "perpetual access" and ownership 
rights that publishers like Elsevier often include 
in their contracts are in some measure convenient 
fictions we use to convince ourselves that the old 
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ownership, or stewardship, model still works in 
the digital age. . . . But in practice die tremendous 
uncertainties of digital preservation thwart our 
ability to make useful distinctions between nomi- 
nal ownership and leasing of information. This is 
not to minimize the fundamental importance of 
this distinction in principle — I assume we all try 
to negotiate permanent or archival access to any 
product we license. 42 

The authors tested the latter assumption in August 2004 by 
posting a brief survey to the Chief CDO's electronic discus- 
sion list, which includes the CDOs of the largest Association 
of Research Libraries (ARL) member institutions. The 
authors' institutions were not included in the survey, nor 
were national and public libraries, which left a suivey 
population of 38 potential respondents out of 47 members. 
CDOs at 25 institutions responded (for a response rate of 
65.8 percent, or two-thirds of the authors' peers). Members 
were asked whether, during e-journal license negotiation, 
they ask for perpetual access rights to subscribed years 
upon cancellation, and whedier they consider the lack of 
perpetual access to be a deal breaker. While this was not 
a scientific sample, the results were still informative and 
clear — a majority of respondents (76 percent) consistently 
ask journal publishers for perpetual access, yet a majority 
(also 76 percent) would still sign an e-journal license without 
assurances of such access. 

An ARL suivey on e-journal licensing, conducted die 
previous year by Case, was published in die same month the 
authors' survey went out to the Chief CDOs list and found 
similar results. 43 The ARL survey documented a gradual but 
definite move towards e-only access. In 2002, only a few 
of die 40 respondents were going e-only; in 2003, 25 of 57 
respondents (44 percent) were doing so for new packages, 
and "another six [respondents] indicated . . . they seemed to 
be moving in that direction." 44 Further, a minority of ARL 
institutions view "Archival Access/Perpetual Rights/'Own' 
the Content" as a deal breaker in license negotiations (14.6 
percent of the respondents for ARL, where die Chief CDOs 
survey found 24 percent). 4 Put anodier way, using Case's 
survey, the remaining 85.4 percent of ARL institutions evi- 
dently do not view lack of perpetual access as a contractual 
deal breaker; most likely, many of this clear majority of the 
respondent institutions were among the 44 percent who 
were going e-only for new packages. One must take note 
that so many research level libraries are not assigning greater 
weight to perpetual access guarantees when making the big 
move to e-only collections. Taken togedier, both surveys sug- 
gest that fewer than 1 in 5 large research libraries say no to a 
license that does not provide perpetual access assurances. 

A subsequent suivey sent to the e-journals discussion 
list, to which primarily United Kingdom academic librar- 



ies responded, echoes the previous findings. 46 A substantial 
number of the institutions (80 percent) had "already moved 
to e-only for at least part of their collections," but a lack of 
prior planning was occasionally reported: "Sometimes this 
[decision] is underpinned by a strategy or policy, but more 
often it seems to [be] happening as a reaction to other pres- 
sures," such as "shrinking bookfunds and lack of space." 4 ' 

With the large numbers of libraries going e-only, as 
demonstrated by these surveys, one cannot realistically 
expect many libraries to rely on a print copy as the pri- 
mary means of perpetual access. For example, Bracke and 
Martin maintained that, for the University of Arizona's 
Science-Engineering Library "shrinking budgets and space 
constraints make print retention an untenable and non- 
strategic plan of action." 48 At the University of Minnesota 
and University of Wisconsin, the move to e-only journal 
access is encouraged for the same reasons. In cases where 
a specific license does not permit perpetual access rights, 
retaining a print subscription for specific titles may be an 
affordable option, especially if the library belongs to a con- 
sortia that participates in a print retention project. 

Access through the Publisher 
When the Journal Changes Hands 

The transfer of journal title ownership from one publisher 
to another is an increasing problem. McDonald defined the 
problem thus: 

Libraries must have a secure sense that the pub- 
lisher or the project is reputable and likely to retain 
their electronic publishing program at a reasonable 
cost in future years. The purchase of Academic 
Press by Elsevier has laid a heavy economic bur- 
den on many libraries and future purchases of 
commercial publishers by odiers can wreak havoc 
on the information resources for any library. . . . 
Libraries that long ago canceled print issues of 
journals from Academic Press in favor of electron- 
ic only access [through the International Digital 
Electronic Access Library (IDEAL) service] are 
now faced with huge cost increases when renewing 
these licenses through Elsevier. 49 

Eells noted that Elsevier did not honor some per- 
petual access commitments for former IDEAL customers, 
meaning that customers essentially had to pay for access 
to some years twice (once through IDEAL, then through 
ScienceDirect). 50 Bird and Waller examined the licenses 
for 6 publishers that offered what the authors termed "big 
deals" (Academic Press, American Chemical Society [ACS], 
Elsevier, Institute of Physics, Royal Society of Chemistry, 
and Springer), with particular attention to perpetual access 
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clauses for journals that these publishers sold and that were 
no longer part of the current package. All of the surveyed 
publishers except Springer offered perpetual access rights 
to the Canadian Research Knowledge Network as part of 
the license and delivered ongoing access upon the sale of 
ten randomly chosen titles. 51 

Access through Local Data Loading 

During the last decade, academic libraries have been mov- 
ing from locally loaded databases in favor of Web access 
through the publisher's site. The primary reason is die high 
infrastructure cost for librarians to network content stored 
on disc, but the user preference for the convenience of the 
Web and the potential lack of enhanced functionality, such 
as cross-links are factors as well. 52 LOCKSS addresses many 
of these needs, but the lack of widespread adoption by 
librarians and publishers merits the consideration of other 
options. While the OhioLINK consortium has been able to 
secure perpetual access rights to locally loaded journal con- 
tent in licenses from all publishers except ACS, Diedrichs 
noted that "the investment required in the archiving process 
is considerable and the question of who best should perform 
that function — the library, the publisher, some middle- 
man — is yet to be resolved for most libraries." 3 

Grudzien, head of collection development at Central 
Michigan University Libraries, illustrated that even if one 
is able to obtain perpetual access assurances, the option 
of local data loading is not to be taken lightly, citing late 
delivery of Elsevier journal content, and subsequently the 
need to both locate outdated tape drives for accessing the 
content stored on digital linear tape and to organize the 
content itself. 54 The fact that only 10 Elsevier customers 
worldwide have opted for the locally loaded version of 
ScienceDirect may be an indication of the preference for 
accessing e-journals at the publishers site and thus avoid- 
ing the attendant infrastructure and staffing costs of local 
data loading. 5 

Access through Third Party Solutions 

The e-Depot at the Koninklijke Bibliotheek (the National 
Library of the Netherlands) acts as an archive for 
BioMedCentral, Blackwell Publishing, Elsevier Science, 
Oxford University Press (OUP), Sage, Springer, and Taylor 
and Francis journals. 58 However, for institutions outside the 
Netherlands who cancel any subscriptions to these journals, 
e-Depot is only an archive of last resort. The head of e- 
Depot has said that these depositing publishers allow three 
limited types of access: 

• on-site access for registered users; 

• ILL supply in the Netherlands; and 



• access for any licensee in case publishers cannot meet 
obligations (calamities, bankruptcy). 

Subscribers of major e-journal packages outside the 
Netherlands cannot cancel titles and expect to get free 
ILLs of articles for their users; they will need to make other 
arrangements. Still, the experiment may show promise as a 
model for other countries, such as the United States (per- 
haps in conjunction with consortial site-licensing projects at 
a regional or national level), and bears watching. 

A research library's mandate to provide current access 
to journals for today's scholar's can be at odds with the man- 
date to keep those journals available to be accessed by schol- 
ars in the future. Librarians still value uheir stewardship role 
in the digital realm, but they perhaps fear that pressing the 
issue contractually is commercially and financially unreal- 
istic at this time. If the level of public support for higher 
education continues to erode, acquisition budgets continue 
to dwindle accordingly, and research libraries continue to go 
e-only, this practice will have major implications for the type 
of library collection scholars will see in the future. Research 
libraries will have to grapple with the way they now interpret 
their traditional stewardship role, and come to grips with the 
issue of perpetual access to subscribed e-journals. 

Objectives and Methods 

The goal of this paper is to determine the frequency of con- 
tractual provisions for perpetual access rights for the years 
of active subscription in the event an e-journal contract is 
terminated for any cause other than breach by the licensee. 
If a library asks for perpetual access rights in a license, 
how often will such access provided? Has the access-over- 
ownership model triumphed, or can libraries promise users 
that the electronic content for which the libraries have 
paid will still be available in the future? The authors' work- 
ing hypothesis was that the majority of e-journal provider 
licenses do not provide explicit allowance for perpetual 
access rights. 

Toward this end, the authors analyzed a representa- 
tive set of journal publisher and aggregator licenses signed 
by the University of Minnesota as of the start of the fall 
2005 semester (40, or about one-third, of the library's 
journal publishers), looking for perpetual access clauses. 
September 1 is the usual time the university's journal can- 
cellation decisions are due to its serials agents. Separately, 
the authors analyzed the licenses for 10 major e-journal 
aggregators (defined as "a bibliographic service that pro- 
vides online access to the digital full-text of periodicals 
published by different publishers"). 58 A smaller number of 
aggregators was analyzed because there are not as many in 
the marketplace from which to choose, and a few of these 



50(2) LRTS 



Perpetual Access to Electronic Journals 97 



aggregators control a significant market share. Aggregators 
were included because selectors can be tempted to cancel 
print versions of journals covered in aggregated databases 
just as they would be for an individual journal publisher; 
aggregators were analyzed separately to see if they had 
any unique issues or findings. Thus a total of 50 e-journal 
licenses were analyzed. 

There were three criteria for selecting the journal pub- 
lisher set and the aggregator set: 

• a mix of commercial and society or university press 
publishers; 

• a mix of large and small publishers; and 

• journal collections likely to be held by large libraries. 

By examining collections likely to be subscribed to by 
most research libraries, the authors hoped to get a better 
sense of the implications for lost national access should 
libraries cancel the paper versions of diese journals with 
no print backup or ensured contractual access upon ter- 
mination. By examining equal numbers of commercial and 
society publishers, the authors hoped to ascertain whether 
one type of publisher was more or less likely to grant such 
access. The providers surveyed are listed in the appendix. 

For each e-journal collection from these entities, the 
authors looked at the library's existing license and asked the 
following questions: 

• In the event that the library terminates the license, is 
the publisher or aggregator willing to provide some 
form of perpetual access to the years licensed during 
the subscription? 

• If the publisher or aggregator grants perpetual access 
rights, does the license specify an associated cost? 

• If the publisher or aggregator grants perpetual access 
rights, does the license specify a format or access 
mode (e.g., continuing access through the provider's 
Web site, local data loading through a CD- or DVD- 
BOM or a LOCKSS cache, or access through a third 
party, such as Portico)? 

At die University of Minnesota-Twin Cities and the 
University of Wisconsin-Madison, die practice is to ask for a 
contractual guarantee of perpetual access rights when licens- 
ing electronic access to journals. In some cases, the publisher 
or aggregator generic license did not offer such access, and 
the library had to ask for the right during the license nego- 
tiation process. Sometimes the library had to ask a publisher 
representative for clarification when the license wording 
did not clearly answer die perpetual access question. As any 
licensing librarian knows, this negotiation process is central 
to contract work. The University of Minnesota adapted 
language from a generic academic library single-institution 
license template recommended by ABL: "On termination 



of this License, other dian for cause, the Publisher shall 
provide continuing access for Authorized Users to that 
part of the Licensed Material that was published within 
the Subscription Period." 59 Note that this language does 
not mention costs. Also, wording in the template related to 
choice of format ("either from die Server, or by supplying 
{electronic files) {CD-BOMs} {microfiche} to die Licensee") 
was deleted, as the University of Minnesota did not want to 
invite the prospect of locally loading journal content. 60 

Each final, signed license was examined for any clause 
related to perpetual access rights. If a provider's generic 
contract explicitly granted a perpetual access right (as earlier 
defined by DLF's EBMI), or if the university was successful 
in getting die provider to include this right during nego- 
tiations, this was counted as a "Yes." Often, if included, die 
clause was under such a heading as "Termination," "Usage 
Bights of Lapsed Subscribers," or "Perpetual License." The 
relevant clause was copied and pasted into a master docu- 
ment so all provider wording could be compared side by 
side. Decisions on availability of access rights for each pro- 
vider were recorded in an Excel spreadsheet, as were any 
contractual mentions of cost or access format. 

If a license was silent on the issue of perpetual access 
rights, this was counted as a "No." The authors followed the 
ABL's Strategic and Practical Considerations for Signing 
Electronic Information Delivery Agreements, which advise 
the licensing librarian to get desired rights in writing: "If 
the language of a contract leaves you feeling unsure how 
users may use the materials, clarify the contract by writing 
in what you need." 61 Because every license has a specified 
term or period of performance, once that time has past, 
a university has no rights that the agreement does not 
expressly grant. This is why one sometimes sees contractual 
provisions stating that the rights and obligations of section 
X shall not be terminated upon the conclusion of the term 
of the agreement. So, if the license renews annually and a 
university wishes to have access to the material after the 
term's end date, then the license needs to include a provi- 
sion granting the continued right of access. While a library 
could attempt to renegotiate an existing license that does 
not explicitly grant perpetual access rights in the future, 
the authors were only interested in the licenses that grant 
such access at the present time. This paper thus affords a 
glimpse of what the University of Minnesota would have 
perpetual access rights to the day after the library canceled 
its e-journal subscriptions with the surveyed publishers 
(through its serials agents or directly), barring any later 
renegotiation with these publishers. 

Results 

Overall, most publishers — by almost 2 to 1 (64 percent to 36 
percent) — in the sample grant perpetual access (see table 1). 
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This initial finding makes the prospect of continued access 
sound realistic if librarians ask for the right and if librar- 
ians vote with their dollars. For years, librarians have been 
discussing die pros and cons of access versus ownership of 
material; however, in the electronic content delivery climate 
of today, that unembellished model is too simple to cope 
widi the related complexities and ramifications in ongoing 
discussions of the current e-journal licensing environment. 
In the authors' perspective, perpetual access should be the 
clear objective. To that end, examining and defining how 
such access will be provided, to what content and by whom, 
and under what cost and format conditions, is necessary in 
library license negotiations. 

Within die scope of the licenses examined, more of the 
commercial publishers' agreements allow or provide for per- 
petual access rights than do those of society publishers — 72 
percent of commercial publishers and aggregators, com- 
pared to 56 percent of society publishers and aggregators. 
Almost three-quarters of commercial publishers' licenses 
provide for perpetual access, while more than half of societ- 
ies' licenses do not mention or explicitly decline to provide 
perpetual access. Libraries' past experiences with scholarly 
publishers versus commercial publishers (with regard to 
pricing issues and a general cooperative spirit) created an 
expectation that the society publisher agreements would 
naturally be more likely to provide for these access rights. 
The actual outcome was unexpected, considering die belief 
(from the perspective of the authors) that society publishers 
will be more responsive and responsible players in the schol- 
arly communication realm. The publications of scholarly 
societies are essential components of academic and research 
library collections, and these publications present a balance 
to the commercial publications also relied upon to dissemi- 
nate scholarly research. 

Some of the declining publishers give their "No" 
answer right in the contract and did not rely on silence 
alone. For example, the following clauses seem to say 
"Don't even ask": 

Walter de Gruyter does not warrant that the 
Licensed Materials will be made available 
permanently. 62 

Table 1 . Publisher's contractual position on perpetual access rights i 



In the event of termination or expiration, the 
Subscriber may not retain any portion of the 
[Association of Computing Machinery Digital 
Library]. 63 



Publisher 
type 

Commercial 

Society 

Total 



Yes 

is 

14 

32 



% yes 
within this 
% yes publisher 
overall type 



36 
28 
64 



72 
56 
n/a 



No 

7 
11 



% no 
overall 

14 

22 
36 



Some publishers with free online access with print sub- 
scription arrangements simply tell libraries that they lose 
online access when they cancel print subscriptions. More 
and more, though, these publishers are reversing die tradi- 
tional cost base, making print an add-on to electronic access, 
rather than the other way around. In this new environment, 
perpetual access rights would then have to be negotiated as 
part of a renewal. Haworth Press goes even farther, advising 
librarians not to cancel print prematurely. Because Haworth 
Press considers "the electronic/print serial scene [as] still too 
volatile and too early in an infancy stage to allow [Haworth] 
to dismantle [its] entire in-house printing operation," they 
regard the print version as the "archival back-up" to the 
electronic version. 64 In the absence of an explicit contractual 
guarantee of perpetual access rights for the online version, 
or a collaborative print retention program for this publisher, 
one should weigh the issues carefully before canceling the 
print version of a Haworth journal. 

The American Medical Association's representative, 
in an e-mail, explained her society's stance this way: "We 
do not allow perpetual access. A site license is like getting 
cable — you are paying for access to the programming, not 
the ownership of the actual programming." 63 This statement 
seems to preclude any possibility of entering into a discus- 
sion of the issues as part of a contractual negotiation. 

Some publishers go so far as to require the licensee 
to destroy all copies of downloaded articles. For example, 
one commercial publisher says, "Licensee agrees to destroy, 
and will use reasonable endeavors to instruct Authorized 
Users to destroy all Licensed Material stored on any digital 
information storage media, including, but not limited to, 
system servers, hard discs, diskettes, and back up tapes." 66 
Librarians try to negotiate the deletion or rewording of such 
clauses, due to privacy and enforceability concerns, but 
again the implication is that the library does not own this 
data and thus does not have a continuing right to access the 
data. Implicit in this clause is a major shift in the concept 

of ownership that needs to be 
n some form (n=50) examined carefully, not adopted 

out of short-term expediency. 

0/, ° n o A handful of providers note 

within this specific expiration parameters 

publisher r r i ^ 

type Total * or P er P e tual access m the 

licenses. Springer specifies two 
28 25 years, the Online Computer 

44 25 Library Center (OCLC) five 

n/a 50 years, and Karger ten years. 67 

Perpetual access may not be 
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perpetual at all; the clock may be ticking. Along with cost 
and format, this is yet another aspect that must be consid- 
ered by selectors when deciding whether to cancel a title. If 
nothing else, the selector may need to reconsider reinstating 
such a title or package, or renegotiating some form of "sub- 
scribed years" access, probably for a price. 

Separately, the authors analyzed the licenses for 10 full- 
text database providers (as earlier defined), 5 of which are 
commercial providers and 5 of which are society providers. 
Several of these 10 are often referred to as aggregators, in 
that they provide journal content from publishers, but are 
not the publishers themselves and generally do not provide 
for continued access should the subscription be terminated. 
The commercial providers in this group are from EBSCO 
Industries, Gale, H. W. Wilson, Ovid, and ProQuest; the 
society providers are BioOne, Highwire Press, JSTOR, 
OCLC Electronic Collections Online (ECO), and Project 
Muse. Few offer any contractual guarantee of perpetual 
access. Canceling a print title included in such a package 
is risky. Their title lists and coverage frequently change, 
depending on their contracts with the publishers. If a 
library's agreement is not renewed, one cannot count on any 
continued access to the journals therein. 

Cost Issues 

In a parallel finding that tempers the two-to-one result 
about perpetual access being available, the authors found 
that almost half of the providers that allow perpetual access 
specify that a charge will or may be associated with this 
access (14 of 32 granting providers, or 43.8 percent); see 
table 2. A roughly equal percentage of commercial and 
society providers who allow perpetual access charge for 
such access (44.4 percent versus 43 percent). As tire Mellon 
Foundation's journal archiving projects demonstrated, host- 
ing and maintaining content comes with an attendant cost. 
Thus one can reasonably expect a cost to be associated with 
a perpetual access service, whether handled locally, by the 
publisher, or by a third party. 

The licenses that mention cost do so in a variety of ways 
and at differing levels of specificity. In the majority of the 
licenses studied, the fee is to be 
paid to die publisher (75 per- 
cent of commercial publishers, 
100 percent of society publish- 
ers). In a handful of cases, the 
license specifies a fee be paid 
to a diird party or simply is not 
clear. The library's own setup 
costs for loading data locally 
are implied in many cases and 
are not explicitly stated. More 
study is needed in this area for 



purposes of budgeting for perpetual access. Examples of 
contract language about costs follow. 

Some publishers distinguish between content costs and 
access or setup costs. The American Institute of Physics 
(AIP) exemplifies a situation where die library might be 
assuming both types of costs: one for the publisher and 
one for library technology infrastructure investments: "The 
Subscriber will be given die option to purchase a physical 
archive copy, for example a CD-ROM. . . . Any hardware 
or software required to distribute content from the archive 
copy will be the responsibility of the Subscriber." 68 

Most licenses are not as specific about a library's inter- 
nal costs but assume that the library knows about, can cope 
with, and will absorb such hardware, software, program- 
ming, interface design, connectivity, and staffing costs. 
Similarly, the American Psychological Association (APA) dis- 
tinguishes between content fees and access fees: "Although 
[the customer] would not pay ongoing data fees, they would 
pay for the delivery of the content." 69 Either way (whether a 
library is paying die publisher or library or technology ven- 
dor support staff), sometimes multiple and ongoing costs for 
perpetual access will exist. 

Specific costs usually are not detailed in the license, 
even when the concept is mentioned and can be subject to 
change at die publisher's discretion. Typical wording is: 

There may be fees associated with these options.' 

Wiley will provide the Licensee with access to 
the full text of the Licensed Electronic Journals 
published during the Term of this License ... at a 
cost-based fee agreed by both parties. 11 

Upon termination of this Agreement, Licensee 
may retain the right to use in archived form the 
content of the Database provided that Licensee 
. . . pays all costs associated with providing the 
Database content to Licensee on a mutually agree- 
able media type . . . Licensee acknowledges that 
the terms and conditions applicable to Licensee's 
archiving rights under this paragraph including, but 
not limited to, the media type and annual fee or fee 



Table 2. Granting publishers' contractual position on cost for perpetual access rights (n=32) 
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per year of archive material may be modified by 
[Institute of Electrical and Electronics Engineers] 
at its discretion. 12 

Of the three previous examples, the likelihood of a fee 
increases with each succeeding quote. The first says there 
may be a fee; the second says that there will be a fee and 
the library has a say in what that fee might be; and the third 
says there will be a fee and the amount is open-ended. So 
the potential exists for perpetual access fees of undeter- 
mined amounts that may change from year to year. Because 
perpetual access is a relatively new and undefined service, 
this is not surprising, but libraries must be aware of the bud- 
getary implications of open-ended financial commitments. 
One could plausibly argue that the uncertainty surrounding 
perpetual access fees may not be substantially different than 
the fluctuations in journal prices; still, this fluidity will make 
budget projections even more challenging to produce than 
they are now. The license negotiation process is die library's 
opportunity to participate actively and creatively in further 
definition of cost assessment, perhaps in consultation with 
colleagues at peer institutions. 

Sometimes the cost is specified, at least with respect to 
existing subscription fees. For example, East View specifies 
a fee of 10 percent of the current annual subscription cost. 
Sometimes the cost depends on the format, and the effort 
required to convert the journals to or deliver the content in 
that format. ' 4 

In addition to LOCKSS access, the American Society 
for Microbiology (ASM) allows open access to its journals 
after a specified time, but ASM reserves the right to charge 
a fee (for current and back-file issues) in the future. This 
license serves to remind one of the unknown long-term 
potential of relatively new open access models. 

The offering of separate back-file journal sets for years 
not covered in a current subscription is increasingly com- 
mon (e.g., from ACS, Annual Reviews, Elsevier, Nature, 
Wiley). One might easily find oneself in the situation of hav- 
ing bought a journal back-file set, but being without access 
to current issues due to cancellation of a title for which 
perpetual access rights are not contractually guaranteed. In 
cases where perpetual access to a canceled journal is not 
allowed in the license, a library might opt to retain the sub- 
scription but would need to pay more to retain access to the 
back files. The American Association for the Advancement 
of Science (AAAS), publisher of the popular journal Science, 
says that "because AAAS cannot be certain of future technol- 
ogy, storage, or maintenance costs, AAAS cannot guarantee 
[back-file] access" and it "reserves the right to remove all or 
portions of the archive of past issues, or to institute a charge 
for access to it in the future."' 6 

JSTOR has been a unique scholarly project, embraced 
by many research libraries as a trusted repository of key 



scholarly journals, as evidenced by their extensive list of 
participants that have licensed their journals. The expecta- 
tion of the authors, based on JSTOR's good reputation in uhe 
library community, was that the JSTOR agreement would 
be an exemplar for perpetual access. However, the agree- 
ment at the time of this research does not allow for access 
to content after a JSTOR subscription termination. ' ' JSTOR 
charges an archive capital fee; the authors would posit that 
the use of the term capital in reference to this fee implies 
a capital investment by the institution. Learning that uhe 
archive capital fee does not entitle one to permanent access 
to the collections as defined at the time of one's subscription 
is disconcerting. JSTOR's two-fold pricing model, reflecting 
the need to recoup up front and ongoing maintenance costs, 
is a common one. The expectation is that this is a trusted 
repository that a library can depend upon for continued 
access. While libraries might be surprised to learn that their 
initial capital investment alone does not guarantee them 
perpetual access rights to JSTOR content in the future, they 
should expect this separation of setup and maintenance 
costs from other publishers. 

Format and Hosting Issues 

When one looks at the types of formats and hosts that pub- 
lishers offered for perpetual access, one finds that the option 
of continued access through the publisher's own server is 
offered just as often as the option of local data loading (i.e., 
at least one of these two options was offered in 53.1 percent 
of the studied licenses); see table 3. Also, the commercial 
publishers and society publishers studied are similar in the 
percentage that offer such access through their own server, 
with a slightly greater percentage of society publishers offer- 
ing this option (57.1 percent to 50 percent). 

Sometimes one finds a clear correlation between cur- 
rent access practice and perpetual access. Six of the 32 (18.8 
percent) granting publishers that provide Web access will 
continue that access as the sole format. However, as noted 
earlier, Springer puts a time limit on perpetual access to 
its server, after which other options must be considered: 
"Access is granted for two years after cancellation; access 
and storage options are available for subsequent years.'" 8 

By a large margin, more society publishers than com- 
mercial publishers allow the library to load data locally 
(71.4 percent to 38.9 percent), perhaps reflecting a greater 
concern among commercial publishers about retaining con- 
trol of their data. AIP allows the library to copy data from 
a physical archive copy to the library's secure network.' 9 
Project Muse also allows this practice, on DVD, though the 
non-searchable format of their content does not sound very 
useful and may well imply significant, hidden setup costs for 
the library to make the dataset searchable (such as metadata 
tagging): "Approximately 90 days after the expiration of an 



50(2) LRTS 



Perpetual Access to Electronic Journals 101 



annual subscription term, Project Muse will provide, upon 
request, an archival (non-searchable) file on DVD-ROM or 
other appropriate media as determined by Muse, contain- 
ing the content of all issues published online during the 
12-month subscription term." 80 Serving up this kind of data 
will not be plug-and-play for the library and, judging from 
the recent library literature, not enough institutional report- 
ing on local experiences exists (beyond anecdotal electronic 
discussion list postings) to tell if this is a cheaper option than 
paying the publisher an annual maintenance fee. 

Of the publishers included in this survey, 8 commercial 
publishers (Berkeley Electronic Press, Blackwell, British 
Medical Journal [BMJ] Publishing Group, Cold Spring 
Harbor Laboratory Press, Emerald, Lippincott Williams 
and Wilkins, Nature Publishing Group, Springer) and 
8 society publishers (AAAS, ASM, BioOne, Cambridge 
University Press, HighWire Press, Massachusetts Medical 
Society, OUP, and Project Muse) are listed on the LOCKSS 
site as having agreed to make content available through 
LOCKSS. 81 The LOCKSS option is not always stated in 
their licenses. However, just because a publisher does not 
mention LOCKSS in their license does not mean they will 
not consider the option or have not already done so. Silence 
is another reminder of the need for negotiation or for check- 
ing the current list of LOCKSS participating publishers. 
Overall, 16 of 50 (32 percent) of the studied publishers are 
represented as partners on the LOCKSS Web site. In addi- 
tion, 6 of the studied publishers are listed as partners on 
the Portico site. 82 Together with the LOCKSS partners, this 
adds up to 21 of the 50 studied publishers (42 percent) pub- 
licly opting for at least one of these perpetual access mecha- 
nisms. That is a respectable amount, considering that so 
many of these publishers' journals are collected by research 
libraries. These two projects merit consideration as poten- 
tial solutions to the perpetual access problem. Of these 21 
publishers, only 1 (Blackwell) is a partner in both LOCKSS 
and Portico, indicating that 
publishers may opt for only one 
of these approaches. On the 
other hand, 2 major publishers 
included in this study, Elsevier 
and APA, are not LOCKSS or 
Portico partners, which shows 
the need to make sure such 
important publishers have per- 
petual access assurances in their 
standard licenses (as they do) or 
to negotiate their inclusion. 

Of the publishers that allow 
perpetual access, 21.9 percent 
specify that a third party will 
provide the access and deal with 
format conversion issues. The 



numbers for commercial publishers and society publishers 
are very close in this area. Blackwell's wording is typical in 
this regard: 

The Publisher shall . . . provide the Licensee with 
assistance in obtaining continuing access . . . from 
a third party's server provided that the third party 
shall be responsible for any content conversion 
from the format in which the Publisher provides 
the material. 83 

The American Geophysical Union's stance is that any 
third party is solely "for the purposes of long-term preserva- 
tion of the Licensed Materials," and the publisher's server 
and local data loading are the perpetual access options, thus 
making a distinction between preservation of versus access 
to the subscribed content. 84 

Only 1 of the studied publishers names their chosen 
third party in the license; as noted earlier, Elsevier's is the 
National Library of the Netherlands. 85 There are not enough 
specifics in the studied licenses or enough of a track record 
with third-party access to know if this is a valid alternative to 
paying the publisher an annual maintenance fee, or rather 
just an emergency mirror site for such paid access. Care 
must be taken in negotiation to clarify the exact role of tire 
third party and whether the library would really get limited 
access only in an emergency situation, or true perpetual 
access based solely on one's prior subscription history. 

A minority of the publishers who grant some form of 
perpetual access rights present more than one future option 
for access (34.4 percent). Presumably this is at their discre- 
tion, not the library's. For example, Wiley says that it will: 

provide the Licensee with access to the full text of 
the Licensed Electronic Journals published dur- 
ing the Term of this License, either by continuing 



Table 3. Granting publishers' contractual options for perpetual access (n=32) 





Publisher 


Library 


Third party 


Unclear 


Multiple 
options 


Commercial publishers 


9.0 


7.0 


4.0 


5.0 


5.0 


% of all granting publishers 


28.1 


21.9 


12.5 


15.6 


15.6 


% within this publisher type 


50.0 


38.9 


22.2 


27.8 


27.8 


Society publishers 


8.0 


10.0 


3.0 


1.0 


6.0 


% of all granting publishers 


25.0 


31.3 


9.4 


3.1 


18.8 


% within this publisher type 


57.1 


71.4 


21.4 


7.1 


42.9 


Total 


17.0 


17.0 


7.0 


6.0 


11.0 


% of all granting publishers 


53.1 


53.1 


21.9 


18.8 


34.4 
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online access to the same material on Wiley's server 
or by means of an archival copy in the electronic 
medium selected by Wiley (emphasis added). 

Sometimes the choice of format is at the library's discre- 
tion, as with APA: 

APA is committed to providing customers options 
for delivery for site licenses. Currently, those 
options include customer loading, access through 
several vendors, or access directly from APA, and 
there is a separate cost for delivery. The customer 
who has stopped paying for an annual site license 
may also choose one of these options for the seg- 
ment of content for which they retain rights. 8 ' 

Several publishers leave the format intentionally 
unclear. This is more publishers (18.8 percent) than offer a 
third party option. This uncertainty is understandable, given 
how quickly technology becomes obsolete. Slightly more 
commercial publishers than society publishers are unclear 
as to format (27.8 percent versus 7.1 percent). For example, 
the IEEE suggests "a mutually agreeable media type," while 
BMJ offers simply that its journals will be available to lapsed 
subscribers "in electronic form." 88 

Full functionality is yet another concern, no matter 
which party is hosting the content. The Royal Society of 
Chemistry (RSC), despite having the most convincing assur- 
ance with regard to continuing access and multiple formats, 
remains troubling because access to the hypertext markup 
language (HTML) versions of its articles, with reference 
linking, is lost, and only portable document format (PDF) 

so 

versions remain. 

As with cost, this uncertainty as to format makes plan- 
ning and thus budgeting difficult. Will any of these publish- 
ers decide that their own server is not an option and that 
the library must invest in its own infrastructure? Happily, 
for more than half of the studied publishers who allow per- 
petual access, the publisher's server may yet be an option. 

Publisher Continuity Issues 

Few publishers address what will happen if their business 
closes or if they are bought by another publisher — a con- 
cern of libraries in light of such recent events as Elsevier 's 
purchase of Academic Press, Taylor and Francis's purchase 
of Dekker, and Springer's purchase of Kluwer. Elsevier 
guarantees that the National Library of the Netherlands 
testbed archive can be used by institutions in case the pub- 
lisher is no longer able to provide ScienceDirect. 90 Elsevier 
is also is the most explicit of the studied licenses regarding 
the potential transfer of ownership of specific titles to new 
publishers; in such a scenario, Elsevier will try to secure 



access rights to subscribed issues for existing subscribers 
from the new publisher. 91 

RSC commits: "If rights in all or any part of Publisher 
Content are assigned to another publisher, Publisher shall 
use its best endeavours to ensure that the terms and condi- 
tions of this Agreement are maintained." 92 This expression 
of good intentions is well-meaning, but a publisher who sells 
a title or is purchased outright has little power to compel the 
new owner to honor its previous agreements. 

Vagueness of Wording 

Librarians have rightly noted the vagueness and unenforce- 
able nature of the contractual assurances in this area. Some 
illustrative cases: 

Licensor and Licensee shall discuss a mechanism 
satisfactory to die Licensor and Licensee to enable 
the Licensee to have access . . . and the terms of 
such access. 93 

AIP will use reasonable efforts to retain in an 
archive all electronic information published by the 
American Institute of Physics. 94 

What constitutes a discussion? What is a reasonable effort to 
retain an archive? Just because an archive has been retained, 
is it reasonable for a past subscriber to expect access to 
the archive? In fairness, librarians take advantage of vague 
wording themselves to avoid unrealistic obligations or legal 
risk (e.g., the reliance on the word reasonable to character- 
ize the extent of library efforts to inform users of license 
terms). Libraries need to take note when publishers use 
the same tactic in areas where libraries need rights that will 
stand up in court. 

OUP's license has the most striking example of vague 
wording: 

On expiry of the Subscription Period, the Licensee 
shall be entitled to continue to exercise the non- 
exclusive rights granted herein (subject to the 
terms and conditions hereof) but only in respect 
of Material published for the first time during 
the Subscription Period. Nothing in this sub- 
clause requires the Licensor to continue to host 
the Material on its servers after the expiiy of the 
Subscription Period or to make the Material avail- 
able in any other form to the Licensee. 9 

To what are libraries legally entitled here? Arguably, nothing. 
The second sentence negates any entitlement in the first. 

Ovid's contract language demonstrates that, while one 
may be able to obtain a general assurance about perpetual 
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access to a journal package, in cases where multiple pub- 
lishers are part of an aggregator package a provider may be 
unable to consistently assure perpetual access at the journal 
title level: 

Ovid's contracts with nearly all journal publish- 
ers state that when an institution subscribes to 
a journal through Journals@Ovid, the institution 
then has the right to perpetual access to that 
journal. The majority of the journals offered 
through Journals@Ovid include this provision. 
We are optimistic that the few outstanding pub- 
lishers will soon officially adopt this policy as well 
(emphasis added). 96 

Further down in the contract is the caveat that "Ovid's archi- 
val access may be subject to change without notice," which 
calls the whole issue of the enforceability of perpetual access 
guarantees into question. 9 ' At a minimum, one should nego- 
tiate the standard "this contract is amendable only in writing 
signed by both parties" clause, so that one's concerns have a 
chance to be adequately addressed. 

While initially unsettling because the wording appears 
to favor the publisher, such vagueness in license agreements 
is understandable because of the flux in both the perpetual 
access arena (noted earlier in Haworth's license) and the 
technology. Such wording can also be favorable to the 
library. Open-ended wording can give the library an oppor- 
tunity to negotiate more specific terms in the future, by 
which time issues of trust, costs for libraries and economic 
security for publishers, and technology standards may be 
closer to resolution. 

Conclusion 

To obtain useful contractual guarantees of ongoing access 
to purchased electronic content, librarians must work with 
content providers and communicate with peers to create 
and develop robust license language and stable options and 
procedures for perpetual access to subscribed material. The 
primary finding of this license survey is encouraging. The 
majority of individual publishers studied are willing to grant 
libraries perpetual access rights in some form, often con- 
tinuing access through the publisher's server. 

If libraries accept e-journal licenses in their generic 
form (often with no guaranteed perpetual access clause) and 
sign them without negotiation, libraries and their patrons 
risk losing access in the future should the online subscrip- 
tion be canceled. What seems unimaginable now might be 
very real a few budget cycles into the future. This conclusion 
may seem like 20/20 hindsight and not an option for those 
who have already signed restrictive licenses. Nevertheless, 



annual renewals, changes in subscription levels (e.g., from 
a complete package to a limited, title-by-title package), 
entry into a new consortial deal, and publisher notifications 
that they have revised their license for subscribers present 
opportunities to renegotiate a license. 

The experience of the authors' institutions shows that 
perpetual access rights can be gained by asking, and that 
the effort can pay dividends for users. The University 
of Minnesota has availed itself of Elsevier's perpetual 
access rights clause. When the university canceled some 
ScienceDirect titles in 2003, the library was left with 
1998-2002 access to these titles according to its license. 
The titles are not just linked with specific coverage dates 
in the library's MNCAT catalog, but are also activated in its 
link resolver program for those dates, so users can get to the 
titles either directly through ScienceDirect, or indirectly 
through other databases that cite articles in them. This 
example shows the importance to users of perpetual access 
to a canceled title. Library users do not care at the moment 
of need whether a library has a current subscription to a 
title; their concern is whether they can access the article 
they need right at that moment. 

Anecdotal evidence from licensing librarians, such as 
the postings to ERIL-L and Liblicense cited earlier, sug- 
gests that some publisher sales representatives decline 
license requests for perpetual access on the basis that 
the demand from other institutions does not exist. More 
institutions asking for perpetual access rights can only 
help libraries' collective case. But, as standard negotiation 
wisdom agrees, one has to be willing to walk away from a 
deal. Academic libraries proved the potential efficacy of 
this approach if employed in significant numbers when, in 
2001, Nature Publishing Group proposed an embargo on 
current issues of Nature (and subsequently offered a revised 
license without the embargo). 98 After the license review for 
this paper was completed, 'Nature reversed itself and began 
offering perpetual access rights to lapsed subscribers for an 
annual access fee, in what may be in part a response to die 
UC's aforementioned unwillingness to sign its renewal with- 
out such assurances. 99 

If libraries are successful in negotiating perpetual 
access clauses in e-journal licenses, they still may be faced 
with uncertain ongoing costs to maintain such access — per- 
haps substantial costs, even if they are not annual content 
subscription fees. Libraries also may be faced with the need 
to load and serve up the content locally, a practice most 
libraries have been trying to discontinue since at least the 
late 1990s. If libraries cancel print journals carried in large 
full-text aggregator databases, they risk losing online access 
in the future, as few of these databases analyzed here allow 
for perpetual access upon termination in their agreements. 
Due to the varying coverage between aggregators, switch- 
ing between large e-journal databases, such as Expanded 
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Academic Index and Academic Search Premier, can result 
in a significant loss of full-text journal coverage. 

A research library would be wise not to rely too much on 
the prospect of obtaining once- sub scribed articles through 
ILL. As with perpetual access rights, many libraries do not 
negotiate or succeed in inserting language protecting their 
ILL rights in licenses, resulting in what some have termed 
the "death of ILL" — institutions unable to loan articles from 
licensed databases due to contractual restrictions. 100 If more 
institutions go e-only via licenses with ILL prohibitions, such 
restrictions on lending will affect access for peer institutions. 
A 2003 survey of ILL offices in CIC and their e-resource 
licenses found that most CIC member libraries agreed that 
ILL rights were "important but not the first consideration" 
and that most libraries encountered cases when "ILL was 
prohibited entirely [and] when the delivery mechanism was 
restricted, such as in electronic delivery." 101 

The recent research report from CLIR, The Non- 
subscription Side of Periodicals, proposed that perpetual 
access can be paid for, at least in part, by the cancellation 
of print, and the resulting savings to be accrued from the 
nonsubscription side of periodical operations (e.g., the 
binding, shelving, and maintenance of print volumes). 102 
CLIR rightly warned, though, that "some provosts might 
argue that savings should be returned to the general fund 
rather than be redirected within the library," so the library 
will need to educate administrators and funders about the 
real costs of continued electronic access, including setup 
and maintenance fees for the publisher or third party and 
local data loading costs. 103 Note, however, that CLIR did 
not mention the fact that many publishers do not offer 
perpetual access and, upon cancellation of the print copy 
from such a publisher, no guaranteed copy would be avail- 
able. 

The International Coalition of Library Consortia 
(ICOLC) declared that, as "permanent access and archiving 
are of paramount importance," licenses "must include cost- 
effective provisions to purchase and not just to lease or 
provide temporary access" and recommends that libraries 
and consortia "explore new options," such as working with 
publishers to create national and regional repositories. 104 
The question is whether libraries and consortia adopting 
this statement will interpret must literally and refuse to sign 
agreements without such guarantees. 

Publishers cannot be expected to solve the problem of 
perpetual access on their own, but they are participating in 
the definition of roles, however unclear those roles currently 
are. Blackwell representatives believe that "the 'archival' 
role of the publisher is ill-defined but shifting from finite 
to that of open-ended responsibility"; they define appropri- 
ate publisher roles as ensuring that content is available in 
the "short to medium-term," protecting subscribers from 
"harmful changes," and "[working] with key stakeholders" to 



make content available in the long-term. 105 Libraries are one 
of these stakeholders and should partner with publishers as 
ICOLC recommends. 

While institutional repositories are important develop- 
ments, drey alone cannot be expected to solve the problem 
either, at least in the short term. Even if faculty can be con- 
vinced to deposit their research output in such repositories 
(perhaps in addition to the versions available at commercial 
journal sites), these projects are still in their infancy and need 
to be tested. In a recent overview of Ohio State University's 
(OSU) institutional repository project, the Knowledge Bank, 
OSU's director of libraries Branin projected that the project 
"will likely be underway for five to ten years before it is 
mainstreamed or institutionalized at the University." 106 

Concluding Recommendations 

Drawing on the results of dris license analysis and on the 
review of the literature on perpetual access, the authors 
offer the following recommendations: 

1. Libraries should not assume that they may safely can- 
cel the print version of a journal published by a society 
publisher and retain online access to subscribed issues 
if the electronic version is later canceled. Based on 
the research reported in this paper, society publishers 
are not more likely to provide perpetual access to sub- 
scribed articles than are commercial publishers. 

2. Libraries should not assume that they can safely cancel 
the print version of a journal included in a full-text 
aggregator database and retain access. 

3. Libraries should consider making the lack of perpetual 
access rights a deal breaker (i.e., a valid justification 
for not signing air e-journal license). This is especially 
important if the library is also canceling a print ver- 
sion. The University of Maryland and the University 
of California-Berkeley (cited in the introduction to 
this paper) offer helpful precedents for large research 
libraries. The same access terms should apply during 
perpetual access as during subscription access (e.g., 
fair use, rights of walk-in users, remote access). This 
may be a difficult stance to take for journals that are 
considered so central to a collection that one would 
never cancel them. Holding firm for the inclusion of 
perpetual access rights is where consortia, which bro- 
ker many of an academic library's major e-journals, can 
bring their considerable economic leverage to bear. 

4. Libraries should find a way to budget for perpetual 
access. These can be one-time purchase fees (e.g., back 
files from Annual Reviews or Wiley) or annual access 
or maintenance fees (e.g., established packages, such 
as the ACS Journal Archives and JSTOR), or might be 
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a combination of the two (e.g., third parties such as 
Portico). Care must be taken to ensure perpetual access 
rights to current issues in the future even if a back-file 
package is separately purchased. Many packages only 
go back to the mid- to late-1990s, and back files are 
not automatically added to one's package but must be 
licensed separately and are not always available as one- 
time purchases. Librarians must recognize that such 
fees are often justifiable requirements from publishers. 
As libraries give up the binding, shelving, and mainte- 
nance costs of print journals, publishers of e-journals 
have delivery costs that must be borne. Related to the 
cost issue, the profession should advocate for and par- 
ticipate in the development of standardized, affordable 
pricing models for such access. Here, too, consortial 
purchasing power can play a role. 

5. Libraries should explore the potential for trusted third 
parties to host journals for libraries that have sub- 
scribed to them in the past. 

6. Libraries should include wording about publisher 
mergers in perpetual access clauses. 

7. Libraries should ask their legal counsel whether the 
publisher's perpetual access clause is clear and specific 
enough to stand up in court. 
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Appendix. Publishers and Aggregators Whose Licenses Were Analyzed 



Commercial Publishers 

Berkeley Electronic Press 
Blackwell 

BMJ Publishing Group 

Cold Spring Harbor Laboratory Press 

De Gruyter 

East View Information Services (China Academic 

Journals) 
Elsevier 
Emerald 
Haworth Press 

Hindawi Publishing Corporation 
Karger 

Lippincott Williams and Wilkins 

Mary Ann Liebert 

National Journal 

Nature Publishing Group 

Palgrave Macmillan 

SAGE Publications 

Springer (includes Kluwer) 

Taylor & Francis Group (includes Dekker) 

Wiley 

Society Publishers 

American Association for the Advancement of Science 

(AAAS) 
American Chemical Society 
American Geophysical Union 
American Institute of Physics* 
American Mathematical Society 
American Medical Association 
American Psychological Association 
American Society for Microbiology 



American Society of Civil Engineers 
American Society of Mechanical Engineers 
Association for Computing Machinery 
CAB International 
Cambridge University Press 
IEEE 

Massachusetts Medical Society (New England Journal 

of Medicine) 
Oxford University Press 
Boyal Society 

Boyal Society of Chemistry 
University of California Press 
University of Chicago Press 

Commercial Aggregators 

EBSCO Industries 
Gale 

H. W. Wilson 

Ovid 

ProQuest 

Society Aggregators 

BioOne 
Highwire Press 
JSTOB 

OCLC (Electronic Collections Online) 
Project Muse 

*American Institute of Physics (AIP) Scitation; the 
authors included AIP as an individual publisher (not 
an aggregator) because its hosted publishers have their 
own licenses (e.g., American Society of Mechanical 
Engineers) 



110 



LRTS 50(2) 



Bhagirathi Subrahmanyam (bsubrahm@ 
kent.edu) is Assistant Professor, School of 
Library and Information Science, Kent 
State University, Columbus, Ohio. The 
author wishes to thank the editor, the 
reviewers, and Elaine Svenonius for their 
valuable comments and suggestions. 



Library of Congress 
Classification Numbers 

Issues of Consistency and Their 
Implications for Union Catalogs 

Bhagirathi Subrahmanyam 

This study examined Library of Congress Classification (LCC)-based class 
numbers assigned to a representative sample of 200 titles in 52 American library 
systems to determine the level of consistency within and across those systems. 
The results shoived that under the condition that a library system has a title, the 
probability of that title having the sa me LCC-based class number across library 
systems is greater than 85 percent. An examination of 121 titles displaying varia- 
tions in class numbers among library systems shoived certain titles (for example, 
multi-foci titles, titles in series, bibliographies, and fiction) lend themselves to 
alternate class numbers. Others were assigned variant numbers either due to 
latitude in the schedules or for reasons that cannot be pinpointed. With increas- 
ing dependence on copy cataloging, the size of such variations may continue to 
decrease. As the preferred class number with its alternates represents a title more 
fully than just the preferred class number, this paper argues for continued use 
of alternates by library systems and for finding a method to link alternate class 
numbers to preferred class numbers for enriched subject access through local and 
union catalogs. 

As long ago as 1968 the possibility of searching for resources in online catalogs 
through assigned call numbers as access points was raised. 1 A call number is 
a notation that uniquely identifies an item within a collection and on the shelf. It 
is a composite number containing a class number that indicates the class that its 
subject belongs to as per a classification scheme, such as the Library of Congress 
Classification (LCC) scheme, and a book or an item number that uniquely 
identifyies an item within that class, derived usually from the author's name and 
other physical features (e.g., volume number). 2 Today that possibility has been 
realized; many library catalogs provide call number searches. Some even allow 
searching by the class number part of the call number. For instance, OCLC's 
WorldCat, originally a cataloging resource, is becoming increasingly accessible 
to the public and allows searching and browsing by the class number part of the 
assigned call number. Moreover, networked resources are often assigned class 
numbers using the same classification schemes that are being used by the librar- 
ies to classify their resources. 3 

A searcher who chooses to search by call or class number can reasonably 
expect all institutions to have the same class number for a particular title, given 
that the same classification scheme was used as a source for class numbers, and 
only one class number was assigned, even for a multi-foci title. In this context, 
the objective of this study was to ascertain the degree of uniformity of class 
number assignment that exists within libraries and across different libraries and, 
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where deviations exist, to analyze the reasons for deviations 
and their implications for local and online catalog design in 
general, and in academic library catalogs in particular. 

Background 

The many advantages of call numbers in general, and class 
numbers in particular, as access points over author and title 
access points have been well-documented. 4 Unlike vocabu- 
lary searching, call number searching is precise and unam- 
biguous. Unless a cataloger provides for various vocabulary 
approaches, such as variant spellings and synonyms, a user 
may miss relevant items. In contrast, call numbers consist of 
unambiguous sequences of letters, numerals, or both. There 
are no two ways to represent the same class number. Thus, 
call numbers are more precise than vocabulary terms as 
search tools. Moreover, they are becoming more available as 
search tools as more and more network resources are being 
classified using general library classification schemes. 

More importantly, call numbers serve as starting points 
for browsing shelves. Arrangement by call numbers on the 
shelves results in die collocation first by class numbers and 
then book numbers. The underlying principle of any general 
library classification scheme from which class numbers are 
derived is the principle of collocation, which aims to bring like 
tilings together by dividing the universe of knowledge hierar- 
chically into classes and subclasses. Users who know a general 
class number can browse the shelf in any library without going 
to a catalog under die assumption diat like items will be 
grouped togedier by class number. Such shelf browsing, from 
general to specific and vice versa, also can be simulated in an 
online catalog. A user can browse a catalog using either the 
call or class number of a known item wherever he or she is, at 
home or in a library. Such classificatory access helps overcome 
the problem of browsing a divided collection by simulating 
shelf display online, irrespective of the physical location of 
material. However, browsing will be meaningful in an online 
catalog only if die class number assignment for a given topic is 
consistent within and across libraries. 

Contributing to inconsistency in the assignment of call 
or class numbers is that their collocation function is often 
overlooked in favor of using them as mark-and-park devices 
for shelf arrangement. Some materials (e.g., videotapes) 
may not be assigned subject-based class numbers. Libraries 
may find it difficult to keep up with changes that occur in 
classification schemes. 3 As Taylor noted in a study on copy 
cataloging in libraries, many librarians find keeping up with 
the successive editions of classification schemes difficult, 
especially when changes between editions was drastic in 
the form of complete revisions or new editions. 6 In order to 
keep pace with the changes in classification, libraries must 
have die resources — time, finances, and personnel — to ret- 



rospectively classify collections to reflect the changes as and 
when diey occur. In the absence of such resources, libraries 
must reconcile themselves to giving items on the same topic 
two different class numbers. Older collections retain class 
numbers from an earlier edition of the scheme, while newly 
acquired materials are assigned class numbers from the lat- 
est edition of the scheme. This can result in inconsistency of 
class numbers across libraries.' 

One would expect that centralized agencies that pro- 
vide catalog entries to member libraries for copy catalog- 
ing, such as OCLC, and such programs as Cataloging in 
Publication (CIP) would help to standardize the process 
of class number assignment across libraries, but this is not 
always the case. Although OCLC records generally carry 
both LCC and Dewey Decimal Classification (DDC) num- 
bers, records may be lacking one or both. For example, 
OCLC records created before 1968 in the field of law may 
lack Library of Congress call numbers because the Libraiy 
of Congress schedule for law was not available then. In the 
absence of call numbers found on OCLC records, libraries 
have to assign their own call numbers, and these may vary 
from libraiy to library. 

Several studies have investigated the connection 
between copy cataloging and assigned call numbers. Younger 
and Nichols found that many libraries make minor adjust- 
ments to the call numbers available on cataloging copy for 
one or more of the following reasons: (1) to match the num- 
bers of earlier collections; (2) to meet the special needs of a 
given segment of the collection; (3) to add Cutter numbers 
or prefixes; (4) to avoid shelf-list conflicts; (5) to bring like 
subjects togedier; (6) to make the copy fit the latest edition 
of the classification scheme; (7) to fix errors on the copy; and 
(8) to use alternative numbers that fit the local collection 
better. 8 Circumstances such as these result in the separa- 
tion of books on the same topic on the shelf. As a result, call 
numbers tend to become mere mark-and-park devices. 

O'Neill and McCain, in a joint OCLC-Library of 
Congress project, conducted a study of library Cuttering 
practice to learn the extent and types of changes libraries 
made to call numbers, particularly Cutter numbers. 9 Using 
a sample of 895 records created from October 1994 to May 
1995, O'Neill and McCain compared the DDC call number 
from the OCLC record and the final call number assigned 
by the member library that used that record for copy cata- 
loging. They concluded that almost 80 percent of die DDC 
class numbers assigned by die Library of Congress were 
accepted by the local library and another 10 percent were 
accepted with minor revisions. 10 

In another study, Massey and Malinconico examined the 
University of Alabama's originally assigned call numbers for 
shelflisting errors, such as wrong class or wrong Cutter num- 
ber. 11 They examined items cataloged from October 1990 
through March 1995 to determine whether the differences 
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in shelflisting errors between these call numbers and the 
class numbers copied from OCLC were significant enough 
to justify investment in original cataloging. They concluded 
that revising call numbers from a copy for local use was not 
a cost-effective operation, at least for their university. They 
did not study the nature of variations and die reasons for 
such variations. 

Methods 

The objectives of the present study were to answer the fol- 
lowing two research questions: 

1. Among academic library systems using the LCC 
scheme for classifying items, does a given title (where 
title can be defined as the distinguishing name of a 
work) have the same class number across different 
library systems? 

2. What are the reasons for any observed variations in 
assigned class numbers for the same titles? 

The study focused on the class number part of the call 
number, as it is generated from a standard classification 
scheme to represent die subject matter of the title under 
consideration. The book number part, which is derived 
locally by individual libraries and thus subject to a high 
degree of variation, was not considered. LCC-based class 
numbers were chosen because the LCC scheme is widely 
used in academic libraries and, as an enumerative scheme, 
provides little idiosyncrasy in assigning class numbers. 
Academic library systems were investigated because a 
majority of them use the LCC schemes and their catalogs 
were, for the most part, Internet-accessible. 

Before collecting data, the following decisions were 
made. First, class numbers of a title would be compared with 
reference to a specific edition. That is, only class numbers of 
a title with a specific imprint (e.g., New York, MacMillan, 
1958) would be compared across various library systems. 
Second, for each title in die sample of titles, the class number 
that occurred most frequently among libraries under inves- 
tigation was defined as the standard class number (SCN). 
Third, each university library system, whether it has one or 
more than one branch, was considered to be a single entity. 
Finally, an imprint-specific class number was considered a 
match if and only if die title in all branches of a library system 
exhibited the same one. Even if the class number differed in 
only one branch, a non-match would be recorded. 

The method research included the following steps. A 
list of 52 large, American university library systems that 
use LCC as the main scheme for their collections and 
whose online catalogs were available in the public domain 
was compiled from the list of the Association of Research 



Libraries' member libraries (see appendix A). The cur- 
rent list has 113 university libraries in the United States 
and Canada; when this project was started in 1996, only 
107 library systems were members. From that list of 107 
library systems, systems in Canada and those not using LCC 
were eliminated. In the end, only 52 library systems whose 
catalogs were available on the Internet formed the library 
sample. The possibility of finding a given title in a large 
library system is higher than in a smaller library. 12 

The six-volume Books for College Libraries was used as 
a source for monograph titles to be sampled for this study. 13 
This publication serves as a retrospective book selection tool 
for undergraduate libraries. The author assumed that a well- 
stocked academic library system will have at least one-third 
of the titles listed in it, and that large university libraries will 
contain a majority of the items listed. 

First, a pretest was conducted using 20 randomly select- 
ed monographic titles from Books for College Libraries by 
collecting class numbers for 20 edition-specific titles. Due 
to the lack of literature on this topic, there was no way to 
predict the probability diat a title will have a consistent class 
number across library systems without a pretest. Such an 
estimate was needed to formulate the hypothesis for testing 
in order to answer the first research question. 

The probability estimate for a title to have the same 
class number across library systems (L. j = 1,2. 52) was cal- 
culated as 0.88 using the formula: 

52 

X Total number of matches in L 



X Total number of titles available in L 

H ' 

Using a conservative value of p = 0.85, the following 
hypothesis for testing was constructed in order to answer the 
first research question: The probability of a title having the 
same class number across library systems is greater 0.85. 

From this chosen probability estimate p = 0.85, the 
minimum sample size of titles "n" was determined using 
the formula, n = z 2 p (1 - pj / e 2 , where z is the standard 
normal variable and e die allowable margin of error. Setting 
the level of significance a = 0.05, e = 0.05, z 00 _ = 1.645, p = 
0.85, 1 - p = 0.15, yields "n" to be 138. Because die first five 
volumes of Books for College Libraries were used (excluding 
the sixth volume — an index), this figure of 138 was rounded 
to 200 titles so that an equal number of 40 titles could be 
selected from each volume. 

In order to test die hypothesis, the required sam- 
ple of titles was randomly chosen from Books for College 
Libraries. Each volume has more dian one section. For 
example, volume 1 (Humanities) consists of five sections — 
General Works, Philosophy, Religion, Music, and Fine 
Arts — covering 4, 64, 89, 51, and 107 pages respectively. In 
this volume, 40 titles were selected by randomly choosing 
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1, 8, 11, 6, and 14 tides, respectively, from the previously 
mentioned five sections based on the relative sizes of their 
listings. The sample of target titles was adjusted to increase 
comparability and achieve representation. For instance, one 
title in die representative sample, Burton's Narrative of a 
Pilgrimage (1969), was available in only one library system; it 
was not possible to make a comparison of class numbers. It 
was replaced by its earlier 1964 publication, which was held 
by several library systems. The year of publication for die 
sample titles ranged from 1911 to 1987. The sample titles 
did not include any titles published in the 1990s because the 
most recent edition of Books for College Libraries was pub- 
lished in 1988. To overcome this lack of representation, the 
author decided to find out whether any of the sample titles 
had editions published in the 1990s. Twenty-one such titles 
were found, and the later editions were used for compari- 
son. The holdings of these 21 titles of 1990s editions were 
compared with those of their earlier editions. For 14 of those 
tides, earlier and later editions were found to be more or 
less the same. For example, 41 library systems held the 1975 
edition of Becker's Human Capital, the original sample title, 
as opposed to 43 that owned its 1993 edition, leading to the 
replacement of the older edition by 1993 edition. In other 
words, a replacement was made based solely on whether a 
library system has at least one copy of the item and nothing 
else; the quality of assigned class number was not even an 
issue at this stage. The year of publication for the 200 titles 
dius adjusted ranged from 1911 to 1995, each year being 
represented by at least one title. The class number for each 
title across each library system was recorded using Microsoft 
Excel. Then for each imprint-specific title, die SCN was 
identified. SCN matches were recorded as 1, non-matches 
was recorded as 0, and the results were tallied in a title data- 
base; this listing is available from the author upon request. 

Analysis— Hypothesis Testing 

Using the data in die title database, the following hypothesis 
was tested: 

The probability p of a title having the same class num- 
ber across library systems is greater than 0.85: 

H : p = 0.85 
Eq: p > 0.85 

H () can be rejected if the T statistic given by: 



^ (0.85X1-0.85)^ 

where p is the probability estimate from a representative 
sample, z is the standard normal variable, the level of 



significance pis 0.05, and z 005 = 1.645 (from the Normal 
Distribution Table). Probability estimate p was calculated 
using the values from Title-Database as: 

200 

X Total number of libary systems having a match for T. 

H * 

200 

X Total number of library systems holding T 

H 

= 6404 / 7303 = 0.91. Inputting N = 7303 and p in Equation 
A: 

0.91 - 0.85 
T = . = 14.13. 

V (0.85)(0.15)/ 7073 

As T = 14.13 is greater than z QS = 1.645, H Q (p = 0.85) is 
rejected at the 0.05 significance level. Consequently, the 
probability that title has the same class number across 
library systems is greater than 0.85, implying that more 
than 85 percent of library systems will have the same 
class number for the same title, which answers the first 
research question. 

Analysis of Non-Matches in the 
Sample Title- Database 

Of the 200 titles in the tide sample, only 79 displayed a 
perfect match to the SCNs across different library systems 
(see title database). The remaining 121 titles exhibited 
non-matches; the extent of the non-matches across library 
systems ranged from 2.0 to 71.4 percent, depending on the 
title. A class number differing from the SCN for a title (i.e., 
a non-match) can be merely a variant, meaning a number 
different from the norm or an alternate, or a number that is 
a legal substitute for the norm. 

Variation can occur in any part of an SCN. A class num- 
ber for a title consists of, at most, three parts. The first part 
is the alphabetical part, which indicates the main class to 
which the title belongs; the second part (the numerical part 
following the alphabetical part) indicates the specific topic of 
the title under that main class; and the third, a Cutter num- 
ber, which may or may not be added, is used to bring out an 
aspect, form, period, place, or subtopic of the subject. An 
alternate or variant class number, one that does not match a 
SCN, can display variation in any one of these three parts. 
For each title, the alternates and tiieir respective SCNs were 
compared to identify where variation occurred. 

Those parts of a title that generally contribute to die 
design of a class number — such as author; subject headings 
as generated from the LCSH list; the form of the title, such 
as biography, bibliography — and the appropriate editions 
of the Library of Congress schedules were examined in 
order to find out whether any of these could explain the 
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presence of alternate or variant class numbers for a title. 
On the assumption that titles are acquired and processed by 
libraries soon after their publication using the then-available 
edition of the schedule, non-matching class numbers were 
analyzed using valid editions of appropriate schedules. For 
example, non-matches for Borden's Robert Laird Borden 
(1938) were analyzed using the second edition (1913) of F 
schedule. The third edition was published only in 1958 and 
was reprinted in 1965. Such an analysis led to identification 
of four groups of titles: 

• Group I: Titles that received alternate class numbers 
(37 titles). 

• Group II: Titles that were assigned variant class num- 
bers due to latitude in the schedules (34 titles). 

• Group III: Titles that were assigned variant class 
numbers for reasons unknown (42 titles). 

• Group IV: Titles that were assigned both alternate 
and variant class numbers (8 titles). 

Group I: Titles with Alternate Class Numbers 

The 37 titles with alternate class numbers fall into the 8 
categories described below. 

• Titles with multiple foci: When a title covers more 
than one subject, it is given multiple subject head- 
ings. From these multiple subject headings, a library 
can choose as a basis for the class number assignment 
one not favored by a majority of libraries. Sometimes 
the subject headings for a title may correspond to 
the same main class but point to different subtopics 
under that class. Sometimes they may indicate dif- 
ferent classes. For example, in the case of Bowen's 
Protestants in a Catholic State: Ireland's Privileged 
Minority (1983; 56 percent, 4.19), 22 library systems 
chose HN400.3.A8 (Social history and conditions, 
Ireland). An equal number of library systems chose 
the alternative BX4839 (Beligion, Protestantism, 
Ireland). (Parentheses for each title include date of 
publication, percentage of library systems with non- 
matches, and serial number in the title database). 

• Titles that belong to monographic series: These 
titles present a dilemma to the classifier: should 
the class number reflect the series to which the 
title belongs, or the contents of that specific title? 
There seems to be no uniform policy among library 
systems regarding the assignment of series versus 
analytical class numbers. Consequently, there is a 
considerable inconsistency in assigned numbers for 
titles in series. For instance, the title Environmental 
Physiology (1974; 45.5 percent, 5.28) by Bobertshaw 
is volume 7 of series titled Physiology Series One. Its 



SCN, QP1 (Physiology, periodicals, societies, serial 
collections), classed it with the series, while the 
predominant alternate number QP135 (Physiology, 
Body temperature) classed it according to its con- 
tents as indicated by the subject heading Body tem- 
perature-Begulation. 

• Literary works of colonial authors: For works of 
authors from ex-colonies, the PB (English Literature) 
schedule provides options for two types of class num- 
bers. They can either be classed as "individual authors 
flourishing after 1960s," or as "authors from colonies, 
provinces etc." Naipaul's A House for Mr. Biswas 
(1961; 41.7 percent, 2.22) and Birney's Spreading 
Time: Remarks on Canadian Writing and Writers 
(1980; 16 percent, 2.24) exemplify this problem. In 
both cases, the SCN classed them with respect to their 
geographical locations, while the alternates classed 
them as individual authors flourishing after 1960. 

• English fiction titles (cataloged before 1980): Until 
July 1, 1980, all works of fiction in English were 
classed in PZ (Fiction in English) at the Library of 
Congress. The PZ schedule consists of two parts: 
PZ1-4 (Fiction in English) and PZ5-90 (Juvenile 
Belles Lettres). Because such an arrangement sepa- 
rated British and American fiction from the rest 
of the British and American literature as well as 
English translations of foreign fiction from their 
originals, the Library of Congress changed its policy 
to class American fiction with PS, English fiction with 
PB, and translations into English with the original 
national literature. Previously, all juvenile nonfiction 
was also classed in PZ (Juvenile Belles Lettres). Now 
all juvenile nonfiction is classed with the appropriate 
subject areas. For example, the SCN of Opie's Oxford 
Dictionary of Nursery Rhymes (1951; 48.6 percent, 
2.14) was PZ8.3 06 (Juvenile Belles Lettres, Verses 
for children) while the alternate was PN 6110.C4 
(Poetry for children). 

• Titles belonging to specific literary form, such as 
poetiy: They can be classed with the form schedule 
or with individual authors. Example: The Dangerous 
Edge (1975; 4.2 percent, 2.23) by Lambert covering 
history and criticism of detective and mystery stories 
had PN3448 D4 (Prose, non-historical novels) as its 
SCN while the alternate was PS3562 A44, a class 
number for an individual author from the American 
Literature schedule. 

• Memoirs: Memoirs can be classed either with lit- 
erature or with the subject area of the memoirs. One 
title illustrated this: Sassoon's Memoirs of an Infantry 
Officer (1930; 42.3 percent, 3.17) with the subject 
heading World War I — Personal Narratives, English 
was assigned D640 (First World War, Personal 
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narratives and other accounts) while alternates 
PR6037.A65, PR6037.S11, PR6037.A86, PR6037. 
A84, and PR 6037.A1 were taken from the individual 
authors section earlier. The alternates also displayed 
variations in Cutter numbers for reasons discussed 
under group II titles. 

• Bibliographies: Should a bibliography be classed 
under Z (Bibliography), or under the class number 
for the bibliography's subject? There seems to be no 
uniform answer to this question. Thus a bibliography 
is assigned a Z-based number by some libraries, while 
others may prefer to classify it under its subject. 
There were 3 bibliographic titles with 9.8 percent, 
14.9 percent, and 13.7 percent non-matches; in all 
3 cases the Z-based class number was the SCN. 
The alternates in each case were the subject-based 
numbers. Example: Griffin's Latin Anwrica: A Guide 
to Historical Literature (1971; 13.7 percent, 5.26) 
had as its SCN Z1601 (National bibliography, Latin 
America) and as alternates F1401.5, F1406, F1408, 
and F1410 (F1401-F1418: History, Latin America) 
of the English literature schedule. The alternates 
also displayed variations in Cutter numbers due to 
reasons discussed under Group II titles. 

• Titles that may fall into more than one category with 
respect to expression: Initially, the subject specialists 
who designed the LC schedules made use of seven 
divisions known as Martel's points: (i) General form 
divisions, (ii) Theory & philosophy, (iii) History, (iv) 
Treatises, general works, (v) Law, regulation, state 
relations, (vi) Study and teaching, and (vii) Special 
subjects and subdivisions of subjects progressing 
from the more general to the specific. 14 A given title 
can fall into more than one division, and it is possible 
for one library to favor one division over the other. 
Unlike the DDC scheme, LCC does not have a 
preference order for assigning class numbers in such 
cases. So, for edited works such as Grzimek's Animal 
Life Encyclopedia (1972; 19.6 percent, 5.30), which 
exhibited the highest non-match percentage for this 
category, the majority of libraries preferred QL3 
(Zoology, collected works), while others preferred 
such class numbers as QL7 (Zoology, Encyclopedia) 
or QL45 (Zoology, general works, 1760-1969). 

Group II: Titles That Were Assigned Variant Class 
Numbers Due to Either Latitude in the Schedule 
Itself or Lack of Enumerated Numbers 

For biographies associated with a specific subject (exclud- 
ing literature), changes may occur in the Cutter parts of 
their class numbers due to latitude in the schedule. For 
biographies, under the appropriate subject, an A-.Z range 



is provided by LCC scheme for the classifier to generate a 
Cutter number unique for die biography. It is the responsi- 
bility of the classifier to generate the Cutter number unless 
the subject of the biography is well-known, in which case the 
LC schedule will provide an enumerated number. Variation 
in generating Cutter numbers produces a corresponding 
variation in class numbers. One example is Mandel's Samuel 
Gompers, A Biography (1963; 14.6 percent, 4.25), which 
was assigned HD8073 G6 by a majority of libraries, where 
HD8073 refers to Labor, Biography and G6 is the Cutter 
number for Gompers from the LC Cutter Table. All the 
alternates for this title occurred in the Cutter area, namely 
G63, G5, G634, and G586. 

In the literature schedules, a provision is made for clas- 
sifying works by and about individual literary authors by 
means of individual author numbers. Again it is the classifi- 
ers' responsibility to extend the appropriate literature num- 
ber by providing a Cutter number to uniquely identify the 
author in question. To uSis end, the classifiers have to use 
the elaborate Author Tables. For example, Wykes's Triad 
of Genius (1953; 28.6 percent, 2.28) is a book consisting of 
the collected works of Edith Sitwell. Its SCN was PR6037 
183. The PR6037 part of the number stands for English 
literature, 20th century, individual authors starting with S. 
The Cutter number 183 was derived from the Author Table. 
Variants in the Cutter number were as follows: 1898, 188, 
19, S62 (probably from the Cutter-Sanborn table), 185, 18, 
and It757. 

The Literature schedules also provide ranges of num- 
bers to accommodate the individual works of prolific authors. 
Differences can occur in the selection of numbers within 
these ranges, thus generating variations in the numerical 
part of a class number. Example: Selected Letters of Samuel 
Richardson (1965; 41.4 percent, 2.34) for which the classi- 
fier is provided with numbers in the range PR3660-3667 
for Richardson as author. Here a majority of libraries chose 
PR3664, while variants included PR3666, PR3667, PR 3666 
A51, and PO 3666 (possibly a clerical error). 

Numbers for classical philosophy exist in bouS the B 
and PA schedules. Even though PA class numbers are used 
only for original Greek and Roman texts, some libraries used 
them also for translations, which should be classed under B 
religion. For example, the SCN for Aristotle's Metaphysics 
(1976; 6.5 percent, 1.29) was derived from the Philosophy 
schedule, while the variants classed it under Greek litera- 
ture. The same discrepancy was exemplified by a nonclassi- 
cal work. For Fowles's Aristos (1970; 7.5 percent, 1.28), the 
SCN classed it with Philosophy, while the variants classed it 
with English literature. 

Four titles demonstrated that schedules J and K are 
capable of producing alternate numbers. La Nauze's The 
Making of the Australian Constitution (1972; 71.4 per- 
cent, 4.27), which exhibited the highest percentage of 
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non-matches (72 percent) among all titles in the sample, 
fell into this category. Its SCN was JQ4011 (Constitutional 
history Australia). More than 20 library systems had alter- 
nate class numbers, none matching any other in K. This 
was probably due to there being no LC law schedules at 
the time the book was classified, so the libraries under 
investigation had to use locally designed K schedules. The 
same was true of Goldfarb's Contempt of Court (1963; 
34.1 percent, 4.26), which received both JK and KF class 
numbers. In the case of such titles, the SCNs and the vari- 
ants were completely different. 

Titles dealing with women's issues also exhibited dual 
numbers. There were two such occurrences: Oilman's 
Women and Economics (1970; 25 percent, 4.31) and 
Women and Development: A Resource Guide (1984; 19 
percent, 4.32). In both cases, the first subject heading was 
Women-Economic Conditions. For Oilman's work, the 
SCN was HQ1426 (Woman, Feminism, Reform literature, 
Later I860-) while the more specific variant HQ1381 
(Women and Economics) was given by 4 libraries. The 
problem with the second title was unique — it had no CIP 
record, and the LC-assigned class number on the OCLC 
record was HQ1240.5 (Feminism, Women and state). 
The Books for College Libraries had HQ 1154 (Woman, 
Feminism, Emancipation of Women, 20th century), which 
was assigned by seven library systems, but the SCN was 
HQ1870.9 (Woman, Feminism, Underdeveloped areas). 

Sometimes the LC schedules provide Cuttering for sub- 
topics. In such cases, variations may occur in the Cutter part 
of the class number. The HD (Economic history) schedule 
has listings of special industries and trade. For some indus- 
tries Cutter numbers are enumerated, while for others they 
are to be chosen from a range. In the example of Glasner's 
Politics, Prices, and Petroleum (1985; 3.9 percent, 4.30), the 
SCN was HD9579 G5, where G5 stands for Gasoline. For 
the non-matches, the variations occurred in the Cutter part 
of the class numbers — for example, G61 and G41. 

Group III: Titles That Were Assigned Variant 
Class Numbers for Reasons Unknown 

Generalizing about the source of variation in class num- 
bers was impossible with 42 titles. Individual libraries may 
have their own reasons for providing alternate numbers, 
but those reasons were not obvious from a mere examina- 
tion of titles. The variations for these titles occurred in 
the alphabetical, numerical, or Cutter parts of the class 
numbers. One example is Solo Song, 1580-1730 (1973; 
25 percent, 1.35), which had as its SCN M1619 M17 even 
though there was no instruction to Cutter by author in the 
M schedule. It had, however, 75 percent matches across 
library systems. Non-matching library systems assigned 
just M1619 or used a Cutter number variation. The CIP 



record used the SCN, which may have served as a source 
for class number. 

Cuttering was used by a majority of libraries in three 
cases to introduce subtopics not enumerated in the sched- 
ules. They were Davis's From the Dark Tower: The Afro- 
American Writers, 1900-1960 (1974; 4.3 percent, 2.35), 
with the SCN PS153 N5, where N5 points to Afro- American 
writers; Mencius (1963; 22.9 percent, 1.30) with the SCN 
BR128 M33, where M33 refers to Mencius; and Platnauer's 
Latin Elegiac Verse (1971; 4.8 percent, 2.36), with the SCN 
PA2335.E5, where E5 refers to elegiac verse. In these 3 
cases, the non-matching variants differed only in the Cutter 
part of the number. 

In some cases, the nonconforming libraries seem to 
have chosen a class number that was broader in scope than 
the specific enumerated number. For example, there was 
a specific class number for Leibniz's Philosophy of Logic 
and Language (1972;2.1 percent, 1.38), yet 1 library chose 
a broader number. In this miscellaneous group, there 
were also 2 titles illustrating that it is possible for editions 
of a title not to get the same class number. In the case of 
Kochan's Jews in Soviet Russia since 1917 (1970; 9.1 per- 
cent, 3.28), its SCN DS135.R9 is an enumerated number. 
DS135 in the DS schedule (1959) refers to Jews outside 
Palestine, and the Cutter R9 refers to Russia, general 
works, 1917-. The variant was DS135.R92, an enumer- 
ated number in a later edition of the DS schedule (1987). 
Exemplifying the same situation was the title Muslim 
Creed by Wensinck (1938; 18.2 percent, 1.31). Its SCN 
was BP161 (General works on Islam, 1801-1950), the class 
number also assigned by the Library of Congress to this 
particular edition. Two library systems assigned BP166.1 
(Islam, History of Theology, General works). A check of 
the BP schedules in use at the time revealed that BP166.1 
was not listed in the appropriate edition (1950). However, 
during the analysis of editions of this title, it was found that 
another edition, published in 1968, was given BP166.1 by 
the Library of Congress, which most libraries also used. 
The use of BP166.1 for the 1938 edition probably can be 
attributed to reclassification of the older edition to match 
the class number of the recent edition by the two library 
systems in question, while others chose to maintain the 
old number for the earlier edition and the new one for 
the current edition. In this case the difference between 
the variant and the SCN was in the numerical part of the 
class number. There are times when a number has been 
discarded in the LC schedules, but some libraries contin- 
ued to use the old number in favor of the new. Two titles 
represented this case. An example is Davis's Systematic 
Embryology of the Angiosperms (1966; 26.3 percent, 5.37), 
for which the SCN was QK 643 A5, while the variant was 
the discarded number QK 693, which also happened to be 
the Library of Congress-assigned number. 



50(2) LRTS 



Library of Congress Classification Numbers 1 1 7 



Sometimes libraries chose a specific number even 
though there was an explicit "see" reference under that 
number directing the use of another number. In the case of 
Franks Historic Pottery of the Pueblos, 1600-1880 (1974; 
2.3 percent, 3.21), a library chose NK4017 (Ceramics) even 
though there was an instruction "for American Indians 
use E99" underneath that entry. In this case the SCN and 
the sole non-matching number were completely different. 
Sometimes, non-matches appear to be wrong. For Walker's 
The Decline of Hell (1965; 5.1 percent, 1.33), the SCN was 
BT836.2 (Doctrinal theology, General works, 1951-) but 2 
library systems chose BT836 (Doctrinal theology, General 
works, 1801-1950). So also in the case of Born's Principles 
of Optics (1980; 9.5 percent, 5.40), the SCN and die vari- 
ant class numbers were respectively QC355.2 (Optics, 
general works, 1970-) and QC355 (Optics, General works, 
1801-1969). 

Group IV: Titles That Received Both Alternate 
and Variant Class Numbers 

Eight titles displayed a combination of sources for variation. 
Their non-matches ranged from 4.3 percent to 54.3 percent. 
The following are two examples. 

• Borde's Sir John Beverly Robinson: Bone and Sinew 
of the Compact (1984; 54.3 percent, 4.23), with 
54.3 percent non-matches, had two subject head- 
ings: Bobinson, John Beverly and Judges — Ontario- 
Biography. Its SCN was KE 406 B63 (Law of Canada, 
Biography). Its variant was KE406 B62, a differ- 
ent Cutter. Its alternates were F1058 B6 (Histoiy, 
Ontario, Biographies) and KE8248 B66 and KE8248 
B6, where KE8248 (Supreme Court of Canada, 
Judges, Biography) represents the number for second 
subject heading, 

• Victorian Science and Victorian Values: Literary 
Perspectives (1981; 33.3 percent, 5.33). This is a title 
in the series Annals of New Academy of Sciences. Its 
SCN was Qll (Science, Societies, United States), 
corresponding to series. Its alternates were Q175.52 
(Science, Philosophy) and PB468.S34 (History of 
English Literature, special topics), reflecting its two 
subject headings. 

Conclusion 

The finding that a title is likely to have the same class 
number in more than 85 percent of the library systems 
holding it was surprising. While one expected that the LCC 
scheme, being an enumerative classification scheme, would 
yield considerable consistency among assigned class num- 



bers, such high degree was nevertheless impressive given 
the study's stringent condition imposed for matching class 
numbers; namely, for a match to occur all the class numbers 
for a title in a library system had to be the same. The ques- 
tion that then arises is how best to bring die remaining 15 
percent of nonconforming library systems into the fold, at 
least online, in order to make developing a universal search 
language that can automatically search all catalogs using one 
class number for a title feasible. 

The sample included titles that must have been cata- 
loged in the precooperative cataloging era. The increasing 
trend of accepting call numbers as is from the copy catalog- 
ing record for local use by libraries will enhance the level of 
consistency in class number for a title across library systems. 
Moreover, the Library of Congress is replacing ranges with 
enumerated class numbers in the online version of die 
scheme as recommended by Williamson, leading to increas- 
ing consistency in the Cutter number part of the class num- 
ber. 15 Thus copy cataloging and replacement of ranges widi 
enumerated numbers will help to reduce the size of variants 
discussed under group II, III, and IV titles in new records. 

Another equally important finding of this study is that 
a fair number of titles were correctly assigned alternate 
class numbers. These include those with multiple foci, 
titles in series, bibliographies, memoirs, titles by colonial 
authors, English fiction titles, and works that belong to 
more than one form of expression in particular, discussed 
under group I titles. An individual library may like to 
continue assigning alternate class numbers in order to 
integrate older materials with new ones on the same topic 
locally. However, such a library will encounter problems if 
it has a blanket policy of accepting class numbers found in 
the copy cataloging record and the copy cataloging record 
has no alternate numbers. For example, the Library of 
Congress provides both Qll and PB468 in its catalog 
record for the 1981 edition of Victorian Science and 
Victorian Values in the sample, but its record for the 1985 
edition has only PB468. Eight library systems that had 
the 1981 edition under Qll used PB468 for their 1985 
volume. The dispersion of even successive editions of the 
title by class number violates the fundamental principle 
of classification, namely collocation, and thus is troubling. 
Ideally, cataloging copy would provide both standard and 
alternate class numbers for those materials, and online 
systems would find a way to link those class numbers. 

In the card catalog era, when class numbers served 
primarily as location devices, libraries were forced to choose 
only 1 class number. Fortunately, online systems have the 
potential to handle multiple access points, including mul- 
tiple class numbers. Internet resource databases provide 
access to thorough multiple class numbers for the same 
source, as has been recommended in the Final Report 
of the Subcommittee on Metadata and Subject Analysis 
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of the Subject Analysis Committee in the Association for 
Library Collections & Technical Services Cataloging and 
Classification Section. 16 It would be useful to exploit this 
potential by providing at least all LCC call numbers in the 
MARC record for a title for the following reasons: 

• First, a preferred call number, together with its alter- 
nates, more fully represents a title than just the pre- 
ferred call number alone. Just as a MARC record for 
a title carries series information and multiple LCSH 
describing its contents fully, so also it might carry 
assigned multiple call numbers in order to increase 
the availability of subject information through call 
numbers. Such a record can be easily integrated with 
digital resource records with multiple class numbers 
as access points. 

• Second, thanks to the activation of many types of 
search keys — such as publication year — in addition to 
traditional ones — such as author — a user can browse 
the collections in a library through many types of vir- 
tual displays with the stroke of a key. For instance, a 
multi-foci title can be assigned multiple call numbers, 
one of which can be designated as a number for fixing 
the physical location of that title in a library, but all of 
which can be searched online. Liberating a library's 
collection from the constraints of linear displays 
on shelves and in the card catalog would enhance 
retrieval by providing the user with multiple subject 
displays. Assignment of multiple class numbers for 
digital resources is now a reality. Multiple class num- 
bers at the searching phase for nondigital resources 
will enable better integration of digital and nondigital 
resources online. 

• Third, an exhaustive listing of class numbers for a 
title — and, by extension, for related titles — would 
give data to researchers interested in subject analy- 
sis. A cluster of call numbers connected by see also 
entries, similar to clusters of subject headings in 
authority lists of subject headings, leading from one 
class number to others connected with it in some way, 
can be a source of valuable information for research- 
ers interested in subject structures. 

In short, where deviations from a common number are 
warranted, libraries could assign two class numbers — the 
Library of Congress-assigned class or call number(s) for 
searching and a local class or call number as a shelf col- 
location device. It would not be difficult for a library, while 
downloading a record from OCLC, to keep die Library 
of Congress-assigned class number associated with that 
record for searching, even if a decision is made to use 
another number for the shelf listing. The precedence for 
such a practice already exists. OCLC member libraries 



can generate subject headings for local use in generating 
entries for their online catalogs. Enabling class number 
searching in sophisticated systems and devising ways to 
reduce inconsistency will give users enhanced and more 
precise access to information. 
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Appendix: Library Systems Sampled 



Arizona State University 
Boston University 
Brigham Young University 
Brown University 
Columbia University 
Cornell University 
Dartmouth College 
Emory University 
Florida State University 
Harvard University 
Indiana University 
Johns Hopkins University 
Kent State University 
Michigan State University 
New York University 
North Carolina State University 
Pennsylvania State University 
Bice University 



Butgers, the State University of New 
Jersey 

Stanford University 

State University of New York, Stony 
Brook 

Texas A & M University 
University of California, Berkeley 
University of California, Davis 
University of California, Irvine 
University of California, Los Angeles 
University of California, Biverside 
University of California, Santa Barbara 
University of California, San Diego 
University of Delaware 
University of Georgia 
University of Hawaii 
University of Houston 
University of Illinois, Chicago 



University of Iowa 
University of Kentucky 
University of Maryland 
University of Massachusetts 
University of Michigan 
University of Nebraska 
University of North Carolina 
University of Notre Dame 
University of Oklahoma 
University of Oregon 
University of Pennsylvania 
University of Pittsburgh 
University of Utah 
University of Wisconsin-Madison 
Vanderbilt University 
Washington State University 
Washington University, St. Louis 
Wayne State University 



Index to Advertisers 

Library of Congress cover 2 

LITA 136 

Archival Products 148 

Library Technologies cover 3 
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Notes on Operations 

Utilizing Z39.50 to Obtain 
Bibliographic Copy 

A Cost-containment Study 

Christine DeZelar-Tiedman, Cecilia Genereux, 
and Stephen Hearn 

This paper looks at one approach to controlling costs when seeking cataloging 
copy. A small task group in the University of Minnesota Libraries Technical 
Services Department conducted a study to devise the most cost-effective strat- 
egy for searching for and importing bibliographic copy, by compiling costs and 
benefits of importing records from the OCLC Online Compu ter Library Center 
database, the Research Libraries Group Union Catalog (RLIN), and the Library 
of Congress (LC) catalog. Results of the study indicated that, although the LC 
database is smaller than the other two utilities, a sufficient portion of needed 
records were available from LC to more than offset the cost of re-searching in the 
other databases for records not found. In addition, due to differences in pricing 
structure, searching RLIN second was found to be more cost effective than going 
next to OCLC, even though a slightly larger proportion of items were found in 
OCLC than RLIN. This study may prove useful either as a research method 
or in terms of its findings for other libraries wishing to compare sources of 
cataloging copy. 
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Many libraries and their technical 
services units face similar chal- 
lenges, including steady state or declin- 
ing operating budgets, implementation 
of new integrated library systems (or 
version upgrades), and continuing 
interest in improving processes to gain 
efficiencies. One goal in cataloging 
units often is to locate and use as many 
copy cataloging records as possible in 
order to speed movement of items to 
the collection and reduce die number 
of items that require original catalog- 
ing. This paper reports the results of 
a research project at the University of 
Minnesota Libraries (UML) Technical 
Services Department that explored 
costs and workflow issues related to 
obtaining bibliographic copy from 
the OCLC Online Computer Library 
Center (OCLC) database, die Research 
Libraries Group (RLG) Union Catalog 
(RLIN), and the Library of Congress 
(LC) catalog. 

The authors describe the record 
import process and workflow prior 



to the project, present the questions 
that the project sought to answer, and 
report the research method and find- 
ings. A literature review places this 
project within the context of other 
cost studies. Implemented changes 
and die resulting financial impact are 
reported. Finally, the authors draw 
conclusions for other libraries consid- 
ering a similar cost-benefit analysis. 

Background 

In spring 2003, a small group, the 
Source of Records Task Group (SRTG), 
was formed at UML to explore issues 
concerning cost and workflow for 
obtaining bibliographic copy from 
various sources. UMLs technical ser- 
vices department was in the midst of 
revisiting and redefining workflow as 
a result of a July 2002 conversion from 
the NOTIS system to the ExLibris 
Aleph500 integrated library system, 
and the department was also seeking to 
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reduce expenditures as part of a librar- 
ies-wide budget retrenchment. 

UML has a longstanding policy of 
cataloging in the local system rather 
than directly in the OCLC database 
or RLIN and of using batch loading to 
contribute to the utilities. RLIN (now 
RLIN21) properly refers to the set of 
database services provided by RLG. 
In this paper, following common usage 
in the profession, RLIN refers more 
narrowly to the RLG Union Catalog. 
Because of the variety and breadth of 
UMLs collections, the libraries have 
found using and contributing to both 
services beneficial. At the time of the 
system conversion in 2002, system 
implementers decided to discontinue 
using OCLC Passport and RLIN for 
Windows client interfaces (hereaf- 
ter referred to as native clients) for 
the purpose of searching and down- 
loading records into the local system. 
Instead, implementers explored the 
option of importing records through 
Z39.50, a protocol that allows a cli- 
ent to search multiple remote servers 
using a single search interface. One 
reason for this was a desire to provide 
staff members with a uniform inter- 
face for bibliographic searching so that 
they need only be trained in a single 
set of commands and search string 
formulation criteria. Additionally, both 
RLG and OCLC were in the midst 
of, or soon to be embarking upon, 
significant redesigns of their own 
search interfaces (RLIN21 and OCLC 
Connexion, respectively). Asking staff 
to learn three new search interfaces 
in rapid succession seemed like too 
much change in too short a time. The 
RLIN for Windows (later RLIN21) 
client would continue to be used for 
some specialized functions, such as 
East Asian vernacular cataloging and 
Name Authority Cooperative Project 
(NACO) authority work. 

With these factors in mind, the 
local Aleph system was configured to 
import bibliographic copy from OCLC 
and RLIN using Aleph 's Z39.50 inter- 
face. A basic outline of the workflow 



for this procedure follows, based on 
Aleph500 version 14.2. In this out- 
line, user refers to the staff member 
performing the searching and copy 
cataloging functions. 

• Upon logging in to Aleph's 
searching module, the user is 
automatically connected to the 
local catalog; a menu is avail- 
able to allow the user to con- 
nect to other databases. The 
user selects an external data- 
base (such as RLIN or OCLC), 
which is accessed using the 
Z39.50 protocol. 

• The user executes a search, 
using the same types of com- 
mands available to perform a 
staff search of the local catalog. 
While the choice of indexes is 
more limited than what is avail- 
able for the local database, and 
slight differences in formulating 
searches exist in some cases, 
the process is relatively uniform 
across databases. 

• If appropriate bibliographic 
copy is found, the user selects 
the record and moves it to the 
cataloging module. Users also 
may select several records from 
a single retrieval set and move 
them to the cataloging module 
in a batch. 

• In the cataloging module, the 
user executes the duplicate 
command, which creates a 
working copy of the record on 
the user's hard drive. The user 
is prompted to select a format 
type (such as books, serials), 
and then is able to edit the 
record as needed, depending 
on the level of the staff member 
and the stage in the workflow. 

• Once a record has been saved 
to the server, associated orders, 
holdings, and item records are 
created as needed. 

Figure 1 presents a workflow 
chart of these procedures. 



Some significant challenges arose 
in implementing this new process. 
First, while the Aleph system theo- 
retically allowed for this method of 
obtaining bibliographic copy, no other 
North American library had yet imple- 
mented it. Therefore, UML systems 
staff had to work veiy closely with 
ExLibris programmers, as well as with 
staff at OCLC and RLG, to develop a 
workable configuration. 

The initial Z39.50 configuration 
was confined to a few indexes, with 
little or no ability to limit or sort 
search results. This led to some frus- 
tration among staff members, par- 
ticularly those who were searching for 
materials lacking standard numbers, 
such as International Standard Book 
Numbers, International Standard 
Serial Numbers, or Library of Congress 
Control Numbers, or for materials with 
common one-word titles, frequently 
found with serials. In this period of 
transition, implementers decided not 
to immediately remove staff access to 
the native clients while encouraging 
the transition to Z39.50. However, 
technical and anecdotal evidence indi- 
cated that many staff were searching 
the native client to obtain an RLIN or 
OCLC number, and then going back 
to die Z39.50 interface to retrieve the 
record or, because of unfamiliarity 
with the new interface and mistrust of 
their initial search results, were follow- 
ing up the Z39.50 search with another 
search in the native client to make 
certain they had retrieved a complete 
result set. The per search cost for 
Z39.50 searches in OCLC is higher 
than for RLIN Z39.50 searches or for 
native client searches in either utility. 
Consequently, staff were duplicating 
searches, resulting in double charges 
and increasing the bibliographic utility 
bills, particularly for OCLC. 

Problem Statement 

In an environment where cost cut- 
ting was needed, the trend of rising 
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Log in to Aleph 
14.2 GUI OPAC 
(search module) 



Open Z39.50 access 
to remote source 



Search for catalog record 
in source 



1 


r 


Display full 




remote source record 



"Duplicate" record creates 
working copy in user's 
Aleph client 



Select record format type 
(e.g., books, serials) 



Edit cataloging copy 
as needed 



Save record to server; 
create orders, holdings, 
and item records as 
needed 



Figure 1. Workflow chart for importing bibliographic records in Aleph500 14.2 
through Z39.50 



bibliographic utility costs was a cause 
for concern. SRTG sought ways to dra- 
matically cut searching costs. In order 
to achieve this, the task group used 
a two-pronged approach. The first 
recommendation, now successfully 
implemented, is not the focus of this 
study. In brief, it recommended that 
additional search indexes and search 
limits be made available for Z39.50 
searching, and that in-depth training 
on searching each of the databases be 
provided to all technical services staff. 
This helped staff gain confidence that 
the Aleph client's Z39.50 search results 
are complete and accurate. 

The second recommendation was 
to perform a study to explore whether 
providing Z39 .50 access to a third biblio- 
graphic database, LC's online catalog, 
would significantly reduce searching 
and retrieval costs. Access to the LC 
database and the import of MARC 
records is available free of charge. 
However, it is a smaller database than 
either OCLC or RLIN. Would the 



cost savings accrued by obtaining free 
records from LC be offset by the 
additional staff time spent searching in 
other databases when records are not 
found in LC? Or is it more efficient 
to search the larger databases first, 
accruing a higher search and down- 
load charge, but using less staff time? 
These factors had to be weighed in the 
task group's decision-making. 

The study sought to answer the 
following questions: 

• Overall, which is the most 
cost-effective order in which 
to search for materials through 
Z39.50 in the LC, RLIN, and 
OCLC databases? 

• What are the relative levels 
of cost effectiveness in rela- 
tion to searching and import- 
ing records from LC, RLIN, 
and OCLC through Z39.50 
as compared to searching and 
importing through the OCLC 
and RLIN native clients? 



• Based on such criteria as for- 
mat, place of publication, and 
year of publication, are par- 
ticular types of resources more 
likely to be found in specific 
databases, thereby precluding 
the previous searching order for 
these resources? 

The data developed for the study 
and discussed in the following were 
able to provide working answers to 
the first two questions. Additional 
research will be needed to answer 
the third. However, because UML 
has chosen to maintain its policy of 
using the Aleph client for searching 
all three sources, no practical test or 
confirmation of predicted results can 
be reported for the use of OCLC 
and RLIN native clients with their 
more complex cost factors. Therefore, 
though the research method needed 
for exploring the second question is 
discussed, only the results relevant for 
searching with Aleph's Z39.50 client 
are reported here. 

Literature Review 

A survey of literature on the topic of 
cost studies, hit rates, Z39.50 search- 
ing, and uhe cost of obtaining catalog- 
ing copy yielded a number of articles, 
but few pertaining directly to the UML 
study. Several articles have been writ- 
ten about cost studies conducted by 
individual institutions. One of the best 
documented is Iowa State University's 
longitudinal cost and time study in 
technical services. Morris; Osmus and 
Morris; Morris, Rebarcak, and Rowley; 
Morris et al.; and Fowler and Arcand 
detail the Iowa findings at various 
points and relating to different catego- 
ries of technical services work consid- 
ered during the study. 1 The Iowa study 
breaks down technical services activi- 
ties into product centers (products and 
services) and overhead centers (leaves, 
administrative and professional activi- 
ties, meetings, and so on). Overhead 
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costs are calculated and then allocated 
to product centers, providing a look 
at the true cost of technical services 
activities. Product centers are sepa- 
rated into the five activities of acquisi- 
tions, cataloging, catalog maintenance, 
volume preparation, and preservation. 
The cataloging product center includes 
searching for cataloging copy, which is 
done primarily by cataloging staff. The 
Iowa reports indicate the number and 
percentages of staff hours spent on 
searching in the aggregate, but they do 
not provide detailed information about 
per-search costs. 

Two of the Iowa study reports 
mention the percentage of biblio- 
graphic records found in OCLC, and 
one categorizes the type of cataloging 
records found. In the first article about 
the Iowa study, Morris writes that, 
"ISUL [Iowa State University Library] 
catalogs more than 90 percent of all 
new monographic titles at receipt with 
copy found in the OCLC database." 2 
In a later report Morris, Rebarcak, and 
Rowley indicate that, "over 90% of 
the monographs ordered in 1994-95 
had an OCLC cataloging record at the 
point of pre-order search." 3 Providing 
the only information on die type of cat- 
aloging records obtained from OCLC 
and the PromptCat Service, Morris 
et al. report that in 1997-98, in com- 
parison with 1990-91 data, LC copy 
rose from 46 to 59 percent, member 
copy rose from 27 to 40 percent, CIP 
declined from 27 to 1 percent, and 
6,325 LC records were added through 
the PromptCat Service. 4 

Research and reports on hit 
rates in bibliographic databases have 
been published since die late 1970s. 
However, the authors were only able 
to find three studies comparing hit 
rates between RLIN and OCLC. In 
the 1990 article, "Chasing MARC: 
Searching in Bibliofile, Dialog, 
OCLC, and RLIN," Allan compared 
hit rates in Bibliofile, Dialog, OCLC, 
and RLIN by searching a sample of 
1,000 English and foreign language 
monographs in April 1987.° For items 



with no exact match, a second search 
was conducted one year later. The 
first round of searching revealed that 
cataloging copy was found in OCLC 
for 86.3 percent of the titles and for 
76.6 percent in RLIN. After the sec- 
ond round of searching a year later, 
the percentages of matches increased 
to 91.2 percent in OCLC and to 85.9 
percent in RLIN. 6 

The second article comparing 
RLIN and OCLC is "Cooperative 
Cataloging of Latin-American Books: 
The Unfulfilled Promise," in which 
Grover examined hit rates in the two 
databases and whether the cataloging 
copy found was LC- or member-pro- 
vided.' Grover used a sample of 298 
humanities and social science mono- 
graphs from Latin American countries 
from three different libraries, with 
the majority coming from a medium- 
sized academic library. Each item was 
checked at six-month intervals between 
November 1983 and May 1985. Grover 
concludes that, "There were no impor- 
tant differences between OCLC and 
RLIN, and in the end, both systems 
had cataloged almost the same num- 
ber, although not the same books." 8 
Grover also notes that almost 50 per- 
cent of the books in both systems were 
first cataloged by LC. 9 

In a report of a study examin- 
ing the availability of bibliographic 
records in RLIN and OCLC for 
Spanish- and Portuguese-language 
monographs, Erbolato-Ramsey and 
Grover find that the two bibliographic 
databases are comparable. 10 Using a 
sample of books that ranged from just 
receipted to 46 months in a cataloging 
backlog, searches were conducted first 
in RLIN. If no cataloging copy was 
found, than the title was searched in 
OCLC. Hit rates were broken down 
into percentages of titles found in 
RLIN, on both RLIN and OCLC, and 
no copy found. The authors found that, 
"By the end of the sixteenth month, 
the percentages were 84% on RLIN, 
91% on RLIN and OCLC combined, 
and 9% for items not found in either 



system." 11 The authors also found that 
LC cataloging records were available 
for 53 percent of the titles in the first 
six months, peaking at 65 percent at 
twelve months. 12 

Little in the literature examines 
the benefits of obtaining cataloging 
records from national libraries. Only the 
recent article by Beheshti, Large, and 
Riva, reporting on the cost savings to 
Canadian universities and large urban 
public libraries by using MARC records 
provided by the National Library of 
Canada (NLC), relates closely to the 
UML findings. 13 The Canadian study 
found drat, "The average annual cost 
saving for a university library when 
using NLC MARC records for derived 
cataloging for Canadian monographs 
and federal government documents 
is $16,400, while the average savings 
for a large urban public library is 
$7,800." 14 Not only do libraries save 
on the cost of original cataloging, but 
Beheshti, Large, and Riva also report 
that most libraries indicated drat they 
obtain the NLC records through a free 
source. Of the sources mentioned by 
libraries, NLC's online catalog, Amicus 
Online, was the "most frequently cited 
single source," and "Web OPACs 
including Z39.50 servers are used by 
76% of libraries." 15 

Few articles about the use 
of Z39.50 by technical services to 
search for bibliographic records have 
appeared in the library literature. 
Reporting on die benefits LC derives 
from the use of BIBCO records (bib- 
liographic records contributed to the 
LC catalog by libraries participating in 
the Bibliographic Record Cooperative 
Program), Wiggins mentions that the 
records are searched for and obtained 
through Z39.50 from OCLC and the 
RLG union catalog. 16 Wiggins also 
notes that cataloging teams will search 
for BIBCO or other source records 
again upon receipt of items, "follow- 
ing their hunches about titles that are 
likely to be represented in OCLC or 
the RLG union catalog." 1 ' 

The first priority of the UML 
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study was to determine the most 
cost-efficient sequence for searching 
OCLC, RLIN, and the LC catalog for 
typical records using Z39.50. Along 
with the most cost-efficient search 
order, the study sought to suggest 
preferred sources of more special- 
ized copy because, just like LC staff, 
staff at UML often use their intuition 
about where copy is likely to be found. 
If preferred record sources could be 
determined for specific types of mate- 
rials, then staff could avoid unneces- 
sary searching. The two approaches 
work to address both the efficiency 
and effectiveness of searching. This 
latter question was not adequately 
answered by die task group's sample 
data, so UML currently follows LC in 
relying on searchers' hunches when 
searching for specialized copy. 

Method 

Sampling Searching Hit Rates 

To get an indication of die best source 
of records for different types of mate- 
rials, SRTG developed an interactive 
project database to track hit rates for 
a sample of materials that staff might 
search in their daily work. Ten staff 
members were asked to keep track of 
their searches during a typical work- 
week, gathering statistics on what level 
of copy was found in the LC database, 
OCLC, and RLIN. The group defined 
typical to mean a week when the vari- 
ety of items searched was not affected 
by such exceptional circumstances as a 
special ordering, a cataloging project 
focused on a particular type of materi- 
al, system downtimes, or other factors. 
The task group relied on the judgment 
of staff doing the sampling that the 
week reviewed was not unusual. The 
project database allowed staff to indi- 
cate whether full copy, minimal copy, 
or no copy was found in each of the 
three databases, as well as the format, 
language, place and date of publica- 
tion, and broad subject area of the 
materials they were searching. 



To achieve a broad cross-section of 
searching, the task group deliberately 
chose staff who work with a variety 
of materials. As student workers per- 
form initial searching in some units, 
student supervisors were recruited 
for the project as well. However, a 
simplified definition of full copy for 
students gathering statistics was pro- 
vided (see appendix), as they could 
not be expected to make the more 
refined judgments that a permanent 
staff member would be likely to make 
regarding level of copy. 

Staff were instructed to search the 
LC database first to locate available 
copy. Searchers recorded whether 
they found no copy, minimal copy 
needing additional work, or full copy. 
The task group provided detailed defi- 
nitions for these terms, as defined in 
the appendix. 

The searchers then looked for 
each title in OCLC and RLIN, record- 
ing the results. Although in an actual 
workflow staff would stop searching 
once a full record is found (or, if the 
search was being performed at time 
of order, when any level of copy is 
found), searchers were asked to per- 
form each search in all three databases 
to make full comparisons of the hit 
rates per database. 

Per-ltem Cost Estimates 

The task group also was aware of the 
need to consider staff time and the 
cost of trying more than one search in 
a source and of repeating searches in 
multiple sources as factors in deter- 
mining preferred sources for materi- 
als. To address these cost components, 
sample data was gathered for the time 
required to search and download a 
record from each source. Estimates 
were made of the proportion of 
instances in which a single search 
would not suffice and two or three 
searches in a single source would be 
required. Lastly, the costs charged 
per search by each source were 
determined. For Z39.50 searches, 



these per-search charges were fairly 
straightforward for UML in the year 
of the study: $1.00 for OCLC, $0.59 
for RLIN, and $0.00 for LC. More 
involved calculations were required 
for estimates of the cost of native 
client interface searches done in 
OCLC and RLIN, as standard num- 
ber searches are charged differently 
from other kinds of searches. Details 
of these cost calculations can be found 
in the final report of the task group. 18 
These data and estimates were 
then combined into formulae to 
express the cost per item for search- 
ing in each of the three sources. For 
each source, the cost of each type of 
search and the staff time required for 
each search were multiplied by the 
estimated frequency of that search 
type in the overall searching pro- 
cess. The cost of the total number of 
estimated searches was then divided 
by the number of searches to pro- 
duce a figure for the average cost of 
searching each source using the Aleph 
Z39.50 client or using the native cli- 
ent interfaces (OCLC Passport and 
RLIN Terminal for Windows). The 
latter figures are included in the task 
group report. However, as both util- 
ities have since converted to new 
native interfaces (OCLC Connexion 
and RLIN21), these calculations will 
not be reported here. 

Per-Search Cost Averages 

By combining these per-item costs 
with the results of the study sample, 
the task group was able to determine 
the comparative costs of searching all 
items in one source, then all unfound 
items in the next source, and all 
remaining unfound items in the third 
source. Because LC's full records are 
distributed to both OCLC and RLIN, 
a sequence calling for searching LC 
after searching either OCLC or RLIN 
was considered superfluous. Given 
that each sequence was known to 
produce the same number of total 
records found, the task group could 
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Table 1 . Hit rates by utility and level of copy found 



Database 



Full copy 



Copy needs work 



No copy 



LC 

OCLC 
RLIN 
Combined 



No. 

168 
246 
224 
340 



% 

39 
57 
52 
78 



No. 

52 
107 
96 
54 



% 

12 
24 
22 
13 



No. 

213 
SO 

113 
39 



% 

49 
19 
26 
9 



total foi UML 




OCLC 



RLIN 




Figure 2. Hit rates by utility and level of copy found 



then calculate the most cost-effective 
sequence of sources for searching 
the sample set of records. This was 
the primary goal of the study — to 
recommend a preferred sequence for 
searching these three sources in terms 
of cost effectiveness, not to measure 
the coverage of the sources as com- 
pared to each other. 

Results 

Sampling Searching Hit Rates 

Data was submitted for a total of 
433 items searched, of which 300 
were books and 133 were nonbook 
formats. While the data gathering pro- 
cess allowed results measurement by 
format, language, publication place 
and date, and subject, the complexity 
of the results and the relatively small 
sample size for particular characteris- 
tics make reliably predicting the best 
source of copy based on each of these 
separate criteria difficult. However, 
some generalizations can be made 
from the data. 

Full copy was found in the LC 
database for 39 percent (168) of the 
433 titles searched, as compared to 
57 percent (246) in OCLC and 52 
percent (224) in RLIN. When positive 
hits for either full or minimal copy are 
combined for each source, the results 
were 51 percent (220), 81 percent 
(353), and 74 percent (320), respec- 
tively. Because OCLC and RLIN both 
contain overlapping but different sets 
of full and minimal records not avail- 
able in LC, UMLs overall hit rate using 
all three sources was 78 percent (340) 
for full records and 91 percent (394) 
for full plus minimal records. Table 1 
and figure 2 illustrate the hit rates for 
die sample set searched in each of the 
diree databases and the overall hit rate 
for the three databases combined. 

These numbers suggest that a 
sufficient portion of copy is available 
free of charge from LC to justify using 
this as the first source of copy. This 
assumption will be furdier explored in 



the next section, which also considers 
time spent by staff in searching mul- 
tiple databases. 

To define more precisely the best 
source of copy for materials based 
such specific criteria as language or 
format, the task group would need 
to collect additional data to build 
up a more reliable sample size. For 
example, the data gathered indicates 
that the LC database may be a poor 
source of records for music scores, 
but a good source for maps. However, 
due to the small number of data 
points gathered, the task group can- 
not conclude this reliably. 



Per-ltem Cost Estimates 

The per-item searching transaction 
costs represent an average of vari- 
ous kinds of search outcomes, based 
on the working assumption that in 
10 searches: 

• 5 will end after 1 standard num- 
ber search; 

• 3 will end after 1 standard num- 
ber and 1 additional author/title 
search; and 

• 2 will end after 1 standard num- 
ber and 2 additional author/title 
searches. 
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These proportions were based on 
limited sampling, and would need 
to be modified if a particular kind 
of material, such as older materi- 
als lacking standard numbers, were 
expected to be a significant part of 
the workflow These proportions are 
also an acknowledged but necessary 
simplification of the options encoun- 
tered in the searching process. Such 
simplifications were considered nec- 
essary to provide a basis for estimating 
the number of searches generated by 
a given number of items so that that 
could become a factor in the per-item 
cost of searching. 

In calculating staff costs, the task 
group assumed: 

• The average time for each search 
transaction is one minute from 
the point a search is entered 
until a record is selected and 
displayed for review. The aver- 
age transaction time for copying 
a record to the Aleph server was 
twelve seconds. Based on the 
limited sampling done, these 
time figures were true for all 
three record sources. 

• The average cost of staff time 
spent on obtaining a record is 
estimated at $15.00 per hour, or 
$0.25 per minute. 

• Multiple searches incur a higher 
per-item cost; hence the approxi- 
mation of the number of search- 
es needed per item discussed 
previously must be a factor in 
calculating staff time as well. 

• Occasional exceptional delays 
in response time from Aleph 
or the source systems were 
not considered. None of the 
sources has been found to be 
delay-prone. 

The task group formulated the 
study's derived per-item cost figures 
for searching OCLC, RLIN, and LC 
by combining figures for the three 
sources' charges for different kinds 
of searches, the estimate of the vary- 



ing number of each kind of search 
required, and the staff cost com- 
ponent of each search, and then 
dividing by the number of searches. 
The per-item cost factors for using 
Aleph's Z39.50 client are shown in 
table 2. 

Per-Search Cost Averages 

The per-item costs were then com- 
bined with the sample set data to 
yield average costs per search when 
different assumptions are applied 
about the order in which the three 
sources are searched. Because each 
item had been searched in all three 
sources, the task group could deter- 
mine for each sequence, beginning 
with LC, how many records would be 
found in the first source, in the sec- 
ond source, and in the third source. 
Searches beginning with OCLC or 
RLIN were assumed to include all 
the records available from LC, mak- 
ing a separate search of the LC file 
superfluous. 

The average per record costs for 
each of the four search sequences are 
found in table 3. 

Based on the study's data and 
estimates, the fourth sequence, start- 
ing with LC, looking next at RLIN, 
and last at OCLC, was found to be 
the most cost effective by a small 
margin. A much clearer margin sepa- 
rated the average costs of searching 
OCLC or RLIN as the first source 
from either of the sequences begin- 
ning with LC. The goal of the study 
was to recommend to staff a pre- 
ferred searching sequence for the 
three available sources. The data 
clearly supported recommending 
that LC's database be searched first. 
Though the preference of RLIN over 
OCLC is not as well-supported by the 
small margin of difference between 
them in the study, the need to give 
staff clear direction weighed in favor 
of recommending the LC/RLIN/ 
OCLC order. 



Table 2. Per-item cost figures for searching 
bibliographic databases using Z39.50 



Database 


Cost per item ($) 


OCLC 


1.825 


RLIN 


1.428 


LC 


0.425 



Note: The formula used for these and other cost 
per-item estimates in the original report is: .5 
(one standard number search cost + staff cost 
for 1 search) + .3 (standard number search cost 
+ title search cost + staff cost for 2 searches) + 
.2 (standard number search cost + 2 title search 
costs + staff cost for 3 searches) = average cost 
per item searched. 



Table 3. Average 


per record cost by 


search sequence 




Search sequence 


Cost per record ($) 


OCLC/RLIN 


3.46 


RLIN/OCLC 


3.16 


LC/OCLC/RLIN 


2.57 


LC/RLIN/OCLC 


2.53 



Discussion 

The advantages of searching LC first 
as a source of records are clear in the 
study's findings, despite the fact that 
LC's database is smaller than either 
OCLC or RLIN. If the proportion of 
records found in LC's database had 
been significantly smaller, this would 
not have been the case. The staff cost 
of fruitless searching would have out- 
weighed the benefits of finding a few 
records at lower expense. However, 
the study's sample data indicate that 
LC is actually a good source for a 
substantial portion of the items for 
which UML needs records. Therefore, 
the cost savings realized by conduct- 
ing an initial search in LC more than 
offset the costs of failed searches and 
of repeating searches in OCLC or 
RLIN for unfound items. This result 
may seem counterintuitive, as it leads 
to more failed searches and more 
staff time expended than would be 
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required if the richest source, OCLC, 
were searched first. However, the costs 
of searches in the two utilities, espe- 
cially of Z39.50 searches in OCLC, 
are so high that the savings realized 
by searching LC first and reducing the 
number of OCLC searches required 
more than covers the cost of the addi- 
tional staff time spent. 

Similarly, RLIN was recom- 
mended as a preferred second source, 
despite having an overall lower hit 
rate tiian OCLC for the sample set. 
The significant difference in the per- 
search charges for Z39.50 searches 
set by the two utilities when factored 
against the sample data indicated that 
marginal savings could be realized 
by seeking records first from RLIN 
when comparable records might also 
have been obtained from OCLC. This 
difference was much less decisive 
than that which determined that LC 
should be the first source searched; 
but as the study's practical goal was to 
guide UML library staff in selecting a 
default order for searching the three 
sources, the task group recommended 
the order indicated. Cost efficiency 
was greatest when the source with the 
highest per-search charge, OCLC, was 
used the least. 



Conclusion and Suggestions 
for Further Study 

Based on the study's findings, UML 
implemented a policy of searching for 
copy first in LC, then in RLIN, and 
last in OCLC. This searching strategy 
enabled the libraries to significantly 
reduce their yearly costs for cata- 
loging searches in OCLC and RLIN 
in the fiscal year 2003-2004, where 
estimates prior to the study had been 
for increased costs. The libraries have 
continued to adhere to tire searching 
strategy, and in the fiscal year 2004- 
2005 were able to maintain lower 
levels of utilities costs for technical 
services while significantly increasing 
overall cataloging production (56,918 



titles cataloged in 2003-2004, com- 
pared to 73,529 in 2004-2005). 

UML upgraded to Aleph version 
16 in early 2005. A major change in 
this version of the software was the 
elimination of the separate search- 
ing module. Full search functional- 
ity, including Z39.50 searching and 
importing of records, is now avail- 
able in all the modules, including 
Acquisitions and Cataloging. The basic 
procedures remain the same, and this 
change has had no noticeable impact 
on the efficiency of searching and 
importing records. 

The calculations required for 
this study are complex and based on 
a number of estimates and limited 
data samples. One would hesitate to 
claim that the numbers the task group 
reports are accurate cost measures 
in every detail; nevertheless, the task 
group is satisfied that the estimates are 
sound approximations of highly elusive 
figures, and that their practical value 
as comparative measures for guiding 
library decision-making will stand up 
to scrutiny. That scrutiny is strongly 
encouraged for any library intending 
to make use of the group's findings. 
Libraries should also consider factors 
that may not have been addressed in 
the UML study. For example, if maxi- 
mizing the occurrence of other system 
numbers — for example, the OCLC 
system number — is important for 
other processes the library is engaged 
in, that would need to be factored into 
the equation. 

The amount of data collected on 
hit rates for particular types of materi- 
als was not large enough to draw any 
conclusions regarding adjustments to 
the preferred search order for those 
materials. Since the initial study, addi- 
tional data on hit rates for serials has 
been collected, and the same could 
be done for other formats, languages, 
subject areas, or older materials. The 
task group recognizes that this would 
be a useful area for further study. 

Other libraries seeking to deter- 
mine the most cost-effective and effi- 



cient sources for copy may wish to 
replicate this study for their individ- 
ual environments, noting the cautions 
identified in the previous paragraphs. 
The authors encourage others to 
explore similar projects and to report 
on their findings. 
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Appendix: Definitions of Level of Copy Found by Bibliographic Searchers 



Copy Level 

No copy 

Copy needs work 
Full copy 



Full copy 
(student) 



Definition 

No matching record is found. Even if a similar or related edition is found, the piece you are search- 
ing for differs bibliographically and a new record would need to be created in the local catalog. 

A minimal record is available, containing at least a matching 245, 260, and 300 field, but for the 
collection for which you are cataloging, the record would need additional authority, classification, 
and/or subject work. 

A record is available that would need no additional cataloging work before the item is placed on the 
shelf, as defined by the criteria of the specific collection. 

For example, periodicals are classed in some collections at UML and not others. If an other- 
wise full record were found that lacked a call number, it would be considered "Full copy" for an 
unclassed periodical collection, but "Copy needs work" for a classed collection. 

Simple definition for student workers: Full copy has matching 245, 260, and 300 fields, at least one 
subject heading (6xx field), and an LC call number (050 field). 
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Notes on Operations 

If You Buy It, Will They Use It? 

A Case Study on the Use of 
Classification Web 

Anna M. Ferris 

This paper presents a study conducted at the University of Colorado at Boulder 
(CU-Boulder) to assess the extent to which its catalogers were using Classification 
Web (Class Web), the subscription-based, online cataloging documentation 
resource provided by the Library of Congress. In addition, this paper will explore 
assumptions made by management regarding CU-Boulder catalogers' use of the 
product, possible reasons for the lower-than- expected use, and recommendations 
for promoting a more efficient and cost-effective use of Class Web at other institu- 
tions similar to CU-Boulder. 
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Catalogers at the University 
Libraries of the University of 
Colorado at Boulder (CU-Boulder) 
began using Classification Web (Class 
Web) in June 2002, shortly after it 
was introduced by the Cataloging 
Distribution Service (CDS) of the 
Library of Congress (LC). 1 At that 
time, Class Web was publicized as 
the first LC resource to offer cata- 
loging documentation via the Web. 
Today, it is used by working catalog- 
ers throughout the world to formu- 
late classification numbers and subject 
headings according to the standards 
and rules published in the Library of 
Congress Classification (LCC) and the 
Library of Congress Subject Headings 
(LCSH). 2 

At CU-Boulder, catalogers have 
come to depend on Class Web for 
various reasons. The most important 
is the convenience it offers catalogers, 
who can now access both LCC and 
LCSH from their personal worksta- 
tions. Other noteworthy advantages to 
using the tool include: 

• a correlation feature that links 
both resources; 

• automatic calculation of clas- 
sification numbers; 

• a correlation (added July 1, 
2004) between Dewey Decimal 



Classification numbers and 
their corresponding LCC num- 
bers and LCSH entries; 

• files that are updated by LC on 
a weekly basis; 

• links to other libraries' catalogs 
(such as LC's Online Public 
Access Catalog), including the 
ability to customize a link to the 
user's own catalog; and 

• software features that allow an 
individual user to limit searches 
to specific portions of files that 
are most pertinent to his or her 
needs. 

The first desktop resources for 
catalogers offered by CDS, Catalogers 
Desktop (Desktop) and Classification 
Plus (Class Plus), were introduced 
at CU-Boulder in 2001, although 
they entered the market in 1994 and 
1996 respectively. In April 2002, CDS 
announced that it would be discontin- 
uing Class Plus in favor of Class Web, 
a new Web interface for accessing 
the same LC classification and sub- 
ject headings data. Based on product 
descriptions at the time, management 
at CU-Boulder assumed that the 26 
catalogers throughout the University 
Libraries would be able to make a 
smooth transition from Class Plus to 
Class Web. Accordingly, the heads of 
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CU-Boulder's cataloging, acquisition, 
and systems departments decided to 
switch die Class Plus subscription to 
Class Web and to keep CU-Boulder's 
user agreement at 24 concurrent users 
at a total cost of $1,500 annually. 

Because Class Web was viewed as 
simply an enhanced version of Class 
Plus, and many catalogers were already 
knowledgeable about how to use Class 
Plus, department heads did not think 
that offering specialized training for 
Class Web was important. Catalogers 
were simply referred to die tutorial 
built into Class Web to help them 
make the transition between the two 
products. The decisions made during 
the switchover to Class Web regard- 
ing the number of concurrent users, 
the type of transition, and training 
were based on assumptions that were 
thought to be valid for the specific 
needs of the cataloging department at 
that particular moment. The validity of 
these assumptions, a users' survey, and 
a statistical analysis of Class Web usage 
are examined in this paper. 

Literature Review 

The cataloging literature includes 
several papers that address the 
importance of providing catalogers 
with access to cataloging resources in 
electronic format. Hine investigated 
the impact that early automation had 
on increasing productivity and qual- 
ity of work and stressed the need for 
further research on the subject of 
automated workstations for profes- 
sional catalogers. 3 Brisson took up 
this charge, and traced the historical 
development and foundations of the 
catalogers personal workstation, cit- 
ing Desktop as "only the beginning" 
of the types of tools being made avail- 
able for catalogers. 4 The impact of 
automation itself on routine catalog- 
ing operations and on working cata- 
logers, in particular, was the subject 
of papers dealing with the process of 
change from a perspective of human 



reactions (Fiste and Thornton) and 
behavioral psychology (Cooper). 5 

Other papers refer specifically to 
the LC cataloging tools. Basic reviews 
have been written about Desktop 
(Leazer) and Class Web (Creamer, 
Selden). 6 Johnson evaluated online 
resources available to serials catalog- 
ers and briefly touched on the appli- 
cability of Desktop for their purposes. 7 
Leroy and Thomas provided a similar 
review of Web-based resources for 
catalogers in general, and included 
short summaries on Desktop and 
Class Web. 8 Simpson and Williams 
went beyond the cursory overviews 
and provided an in-depth analysis of 
the impact that Desktop had on cata- 
loging operations at 159 academic and 
public libraries. 9 

No comparable in-depth research 
has been produced to date on the 
subject of Class Web. This paper seeks 
partly to rectify this situation by pro- 
viding a detailed assessment of how 
Class Web is being used by catalogers 
at CU-Boulder. 



Exploring Usage of Class 
Web at CU-Boulder 

Although no formal evaluations were 
conducted, a year and a half after 
switching to Class Web, catalogers at 
CU-Boulder appeared satisfied with 
the convenience provided by this new, 
Web-based product. As Class Web 
had apparently become a valuable 
resource for CU-Boulder's catalog- 
ers, and because they appeared to 
have adapted to it with little difficulty, 
the author speculated that the same 
product could also prove to be a use- 
ful tool for noncatalogers. Would it 
be helpful, for example, for reference 
personnel to have direct access to 
LCSH at the reference desk? Before 
developing a program to introduce 
Class Web to the more than 20 mem- 
bers of the reference department, the 
author decided to test her assump- 
tions about CU-Boulder's existing use 



of Class Web by gathering data. These 
statistics would establish a baseline 
level of use by CU-Boulder catalogers 
and could then be compared to later 
usage figures once reference person- 
nel began using Class Web for their 
own purposes. CDS expressed inter- 
est in the study and agreed to supply 
summary data reports. 

Upon request (on a case-by-case 
basis), CDS is able to produce a sum- 
mary data report that provides: 

• date; 

• number of maximum, concur- 
rent users logged on to Class 
Web for drat date at a particular 
time; 

• count of how many records 
were viewed each time the 
next or previous buttons were 
pressed during a twenty-four- 
hour period per individual cata- 
loged and 

• count of the total number of 
MABC records displayed per 
user. 

The author requested reports 
for four consecutive months, from 
November 2003 through February 
2004, which recorded the number of 
maximum concurrent users logged on 
to Class Web during that time. 

The summary reports, received 
from CDS on February 23, 2004, 
showed a lower-than-expected use of 
Class Web — out of a total of 26 cata- 
logers who had access to Class Web, 
only 3 or 4 were logged on at a time. 
Figures 1-4 illustrate Class Web use 
for each of the four months, showing 
the maximum number of simultane- 
ous users that were reported for each 
day. Only days for which there was use 
are shown. Average use per day was 
also calculated. 

To gain an understanding of the 
meaning of these baseline statistics, 
each cataloger was surveyed regarding 
Class Web use. The survey, appen- 
dix A, was sent to each cataloger 
by e-mail on April 12, 2004. (The 
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Figure 1. November 2003 Class Web use (14 days) 
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Figure 2. December 2003 Class Web use (22 days) 



Table 1. General usage 



Original 

Copy catalogers catalogers Total 



Respondents 


% 


No. 


% 


No. 


% 


No. 


Use Class Web 


39 


9 


35 


8 


74 


17 


Do not use Class Web 


26 


6 








26 


6 


Use Class Plus 


43 


10 


35 


8 


78 


18 


Do not use Class Plus 


22 


5 








22 


5 


Total respondents 


65 


15 


35 


8 


100 


23 



survey also included questions about 
how catalogers use of Class Web and 
other resources to assign classification 
numbers and subject headings; these 
data are not reported in this paper 
and the questions are not present in 
appendix A.) 

Survey Results 

Of the 26 surveys that were sent, 
23 (88 percent) were returned. Table 
1 summarizes the findings regarding 
general Class Web and Class Plus use. 

The data reveal two significant 
points. First, while Class Web was 
available to all 26 catalogers, and 
management had presumed that 
Class Web was being used by them 
all, 6 copy catalogers (26 percent) 
did not use the product, and 5 of 
these copy catalogers (22 percent) 
had not used Class Plus before the 
switch. A distinction should be made 
between the terms use, non-use, and 
non-adoption. Use refers to the act 
of consciously choosing to consult 
Class Web in order to search for 
information pertaining to LCC or 
LCSH; non-use refers to the interval 
during which a cataloger who usually 
consults Class Web is not using the 
product; non-adoption refers to the 
conscious choice by a cataloger not 
to consult the product. The fact that 
one copy cataloger used Class Plus 
but chose not to adopt Class Web is 
of questionable significance because 
this cataloger reported extremely low 
use of Class Plus — less than once a 
week. (See frequency of use data in 
table 2.) 

Second, the distribution between 
copy and original catalogers shows 
that the percentage of Class Web 
users among the original catalog- 
ers was higher (100 percent) than 
among the copy catalogers (60 per- 
cent). This high percentage is almost 
certainly attributable to the fact that 
CU-Boulder's original catalogers are 
active participants in the Program 



for Cooperative Cataloging and are 
therefore necessarily adherents of the 
cataloging standards and guidelines 
promoted by LC. 



The frequency with which cata- 
logers logged on to both Class Web 
and Class Plus was also explored 
in the survey. Table 2 provides a 
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Figure 3. January 2004 Class Web use (21 days) 
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Figure 4. February 2004 Class Web use (23 days) 



breakdown of how often users logged 
on to each product. 

Possible Explanations for 
Low Class Web Usage 
at CU-Boulder 

While there are a number of rea- 
sons that could explain the lower- 
than-expected usage statistics at 
CU-Boulder, low use or non-adoption 
do not appear to be the result of any 
deficiencies or issues of quality con- 
nected to the LC product itself. Other 
factors that might contribute to lower- 
than-expected usage may include: 



• No formal training was con- 
ducted for Class Web as had 
been done for Class Plus. 

• Copy catalogers do not need to 
use Class Web as frequently as 
original catalogers because call 
numbers and subject headings 
are already provided in most 
LC and member-copy records. 

• Other methods of verifying call 
number and subject heading 
data are available. 

• Catalogers may be experiencing 
computer access problems. 

• Original catalogers may be 
away from cataloging tasks 
frequently enough (at confer- 



ences, on research leave, and 
so on) that this may have an 
impact on the frequency with 
which they access Class Web. 

• The structure of the tool may 
lead catalogers to log on and off 
in reaction to a particular query 
(or set of queries) rather than to 
leave a session running. 

• Some catalogers may be typical 
late adopters; that is, they are 
comfortable with their current 
routines and do not immedi- 
ately appreciate the advantage 
of adopting a new technology. 
Other catalogers may be selec- 
tively technophobic — reluctant 
to shift over to an electronic 
resource when die paper ver- 
sion remains readily available. 
This seems a likely explanation 
for the non- adoption of either 
Class Plus or Class Web. 

CU-Boulder's case is likely not 
unique. In any library, the level of Class 
Web use may be influenced by some 
combination of die previously men- 
tioned factors. While many of these 
factors deserve further investigation, 
the author chose to concentrate on one 
in particular — lack of training — as that 
was a factor diat could be corrected. As 
a result, a formal training program was 
developed in the hope that it would 
increase the use of Class Web. 

Effects of Formal Training 
for Class Web 

To test whether the low use was attrib- 
utable to a lack of training, a hands-on 
introduction to Class Web was pre- 
sented on April 13, 2004, at a meeting 
of all catalogers. As a result of prepar- 
ing for and giving the demonstration, 
several things became clear. The two 
products (Class Web and Class Plus) 
were significantly different. Despite 
offering the same content as Class 
Plus, Class Web's browser interface 
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Table 2. Frequency of use 





Copy catalogers 


Original catalogers 


Frequency of use 


Class Web 


Class Plus 


Class Web 


Class Plus 


More than once a day 


6 


5 


3 


3 


Once a day 








1 





2-3 times a day 


1 





4 


5 


Less than once a week 


2 


5 
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y 
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(average # of users = 4.4) 
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8 - 

7 - 

6 - 




1 2 3 4 5 8 9 10 11 12 15 16 17 18 19 22 23 24 29 30 
(average # of users = 3.2) 



Figure 6. November 2004 Class Web use (20 days) 



and other enhancements made it a 
superior product. Although they were 
encouraged to do so, many catalogers 
did not make use of the tutorial or the 
trial account to familiarize themselves 
with the updated product. Catalogers 
appreciated having someone demon- 
strate the product to them. Logging on 
and off was not an issue — the product 
does log users off after thirty minutes 
of inactivity. 

In order to assess whether for- 
mal training had had a noticeable 
impact on use, the author requested 
two additional summary data reports 
from CDS for the months of April and 
November 2004. (Because November 
2003 was already available, at the time 
of this writing, November 2004 would 
be the first summary report to enable 
a comparison across years.) The 
charts corresponding to die April and 
November 2004 reports are shown in 
figures 5 and 6. 

Data Analysis 

Microsoft Excel 2003 was used for 
the statistical analysis of the sum- 
mary data reports. The descriptive 
statistics following are presented in 
the chronological order in which the 
reports were requested from CDS: 
(1) the initial batch of reports from 
November 2003 through February 
2004; (2) April 2004, for the before 
and after training figures; and (3) 
November 2004, the first report pro- 
viding annual totals. Appendix B pro- 
vides monthly lists of concurrent users 
taken from these reports. 

November 2003 
through February 2004 

Each data point corresponds to the 
number of maximum users logged 
on to Class Web during 14 days in 
November, 22 days in December, 
21 days in January, and 23 days in 
February (see figures 1—4). Shaded 
cells are used in appendix B to show 
weekends and holidays; weekends 



Figure 5. April 2004 Class Web use (22 days) 



account for half of die 12 days that 
only 1 user was logged on to the sys- 
tem at a time. The central tendency in 
the distribution of data points, and as 
seen in the four mean totals, is spread 
between 3 and 4 users per day. 



April 2004: Before and after Training 

The next set of figures, taken from 
the summary data report for April 
2004 (see figure 5), includes the 
training date (marked with an aster- 
isk in appendix B). Background use 
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remained between 3 and 5 users, but 
significantly higher use was reported 
on the days immediately preceding 
and following April 13, the day train- 
ing occurred. Fifteen users logged on 
to the system the day before training, 
and 9 users logged on the day after 
training. The high use on these two 
days can be ascribed to the fact that 
catalogers were taking a closer look at 
the product. Despite these two days of 
increased use, the mean total of users 
remains at a low level (4.409), and the 
standard deviation is 2.823 from the 
mean. When the anomalous numbers 
(15 and 9) are removed from the cal- 
culations, the mean total for April goes 
down to 3.65, and the standard devia- 
tion drops to 1.089 — closer to the pat- 
tern of usage seen in the November 
2003 through February 2004 totals. 

November 2004 

As mentioned earlier, the first set 
of comparative across-year statistics 
received from CDS were for the 
month of November 2004 (see figure 
6). These statistics show a decrease 
in the average number of users (3.2) 
in 2004 as compared to the average 
number of users (3.4) during the same 
month die previous year. Even with six 
more working days in 2004, the aver- 
age number of users is still lower than 
in 2003. Given the unforeseen results 
of this comparison between the 2003 
and 2004 mean totals of Class Web 
users, a t-test analysis was conducted 
to test the significance of the differ- 
ence between the two totals. 

T-Test Results 

The main reason for performing a 
t-test (in this case, a paired t-test, 
as the same subjects were involved 
in both monthly reports) was to test 
the likelihood of whether die author's 
hypothesis, relating low Class Web 
usage to a lack of training, was a valid 
one. The supposition was that once 
catalogers were offered formal train- 



ing, use of the product (and conse- 
quently the statistics) would increase. 

The null hypodiesis (difference 
is equal to 0) presumed little or no 
change between the levels of use 
before and after training. The alterna- 
tive hypothesis (difference is not equal 
to 0) presumed a significant change 
in the level of use after training. The 
determination of a statistically sig- 
nificant difference between die two 
means is reported as a p-value. Using 
the mean totals of 3.4 (November 
2003) and 3.2 (November 2004), the 
calculation resulted in a p-value of 
.709672. Typically, if the p-value is 
greater than 0.05, the conclusion is 
that no significant difference exists. 
Inasmuch as .709672 is greater than 
0.05, the null hypothesis must be 
accepted — no statistically significant 
difference exists between the mean 
totals for November 2003 and 2004. In 
other words, there is insufficient evi- 
dence to conclude that formal training 
helped increase the use of Class Web, 
given that the pattern of use after 
training was comparatively unchanged 
from what the pattern had been prior 
to the training. 

Discussion 

Some important issues were uncov- 
ered by exploring Class Web use 
through an in-depth analysis of sum- 
mary data reports and a usage survey 
for catalogers. First, Class Web use 
(an average of 3 to 5 maximum users at 
any one time) at CU-Boulder was con- 
sistently lower dian expected. This was 
perceived as low because access to the 
resource had been set to accommo- 
date as many as 24 concurrent users. 
Second, formal training did not have 
a significant effect on whether or not 
catalogers chose to use the product. 

Two additional issues were high- 
lighted by the usage survey. The first is 
that managers should not assume that 
every cataloger will need or choose 
to use Class Web. At CU-Boulder, 



26 percent of the catalogers surveyed 
(6 out of 23) did not use the prod- 
uct. Additionally, the extent to which 
catalogers use Class Web may depend 
upon the job responsibilities held by 
those catalogers; that is, original cata- 
logers may need to use Class Web 
more extensively than copy catalogers. 
Anecdotal evidence suggests that senior 
catalogers at CU-Boulder made more 
use of paper documentation in the 
past than they make use of electronic 
resources today, but this difference 
appears to be because their current 
job responsibilities require them to 
focus more on managerial duties and 
less on cataloging or authority work 
functions than previously had been 
the case. Many observed that catalog- 
ers would use the electronic resources 
as much as they had used the paper 
cataloging tools (and perhaps more), 
if their job responsibilities returned to 
what they had been in the past. 

While the results of this study 
were not what management at CU- 
Boulder had expected to find in 
regards to user numbers and the tran- 
sition from one product to the other, 
its significance may well be appreci- 
ated by other institutions that are cur- 
rently subscribed — or are planning to 
subscribe — to Class Web. The author 
hopes that addressing these issues will 
lead to a more efficient and cost- 
effective use of this product. 

Conclusion 

The case study presented in this paper 
has implications for use of Class Web at 
CU-Boulder as well as at odier institu- 
tions. At CU-Boulder, die cataloging 
department will be reassessing its user 
agreement for 24 concurrent users. A 
more reasonable access level will need 
to be determined based on the data 
collected. A complicating factor will be 
whether Class Web is made available to 
noncataloging personnel. If that is the 
case, reducing access to 3 or 4 concur- 
rent users would not be wise. Formal 
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training, accompanied by adequate 
documentation, will continue to be pro- 
vided for new catalogers. Despite die 
formal training's lack of impact on the 
level of Class Web use, survey results 
showed that catalogers did appreciate 
die training, and drat it made a differ- 
ence in the effectiveness of catalogers' 
use of the tool. The benefits of using 
Class Web will be reinforced from time 
to time by reporting on special situa- 
tions or examples in which the product 
can be especially helpful. 

The original impetus for this 
study was to explore the possibility 
of extending the utility of Class Web 
beyond catalogers to noncataloging 
personnel. A study of current use 
among catalogers was considered a 
necessary first step in determining 
training approaches and estimating 
possible levels of use by public ser- 
vices personnel. While this study may 
suggest, based on the moderate use of 
Class Web made by catalogers, that 
expectations for adoption outside cat- 
aloging should be modest, it revealed 
no obstacles to providing appropri- 
ate training and introducing Class 
Web to other units within the library. 
Accordingly, a formal presentation is 
being developed for personnel out- 
side of technical services, such as 
reference staff and librarians, biblio- 
graphers, and library workers at 
branch libraries, in order to demon- 
strate the ways in which Class Web 
can be beneficial to them. A follow- 
up study of potential and actual use is 
also being designed. 

Broader implications exist for 
institutions similar to CU-Boulder. 
The following recommendations will 
address some of these issues. First- 
time subscribers should make sure 
that appropriate concurrent use is 
ordered. As this study shows, match- 
ing concurrent user access to the num- 
ber of potential Class Web users at an 
institution is not necessary. Even at 
LC, the ratio of Class Web users to 
concurrent user access is 4:1 (roughly 
644 potential users to 161 concur- 
rent users) according to Cheryl C. 



Cook, Class Web product manager at 
the time of this writing. 10 First-time 
subscribers should consider ordering 
minimal access at first, and then add- 
ing more access as the need arises. At 
institutions that are already subscribed 
to Class Web, management should be 
mindful of how the product is received 
and used by cataloging personnel — or, 
perhaps, not used. If possible, use 
should be monitored. This is easier 
to do with Web-based products from 
which statistics are readily extracted. 
Statistics are not as easily obtained 
from CD-ROM products unless spe- 
cific parameters are set by systems 
personnel beforehand. 

Areas meriting further inves- 
tigation were revealed through the 
course of this analysis. How do cata- 
logers use Class Web? What moti- 
vates them to turn to Class Web 
rather than to another resource? 
Despite prompting, why do catalog- 
ers tend not to make use of tutorials 
or trial accounts when they are avail- 
able? A follow-up survey of nonus- 
ers might help uncover the reasons 
behind non-adoption decisions and 
low usage in general. Lastly, a detailed 
analysis of Catalogers Desktop use 
would be a welcome addition to the 
literature, particularly as the product 
has recently undergone a similar shift 
from CD-ROM to Web format. 

This study was prompted by an 
interest in expanding the use of Class 
Web to other noncataloging units. 
Many tools developed exclusively for 
catalogers have come to be used out- 
side the cataloging department (for 
example, the LCSH printed editions). 
Some have been transformed for wider 
use (the OCLC database, once seen 
only by catalogers, is today known as 
WorldCat and searched directly by the 
public). When cataloging tools were in 
paper format and expensive, consult- 
ing them outside of technical services 
units presented a problem and limited 
their usefulness. Now that uhese cata- 
loging tools are being offered online, 
access to them is limited only by the 
availability of a computer, a licensing 



agreement, sufficient concurrent user 
access, or simply the awareness that 
such tools exist. 

Class Web is more than a tool 
specifically designed for and widely 
used by catalogers. It represents the 
innovations that are having an impact 
on library personnel everywhere. 
Federated searching, aggregator data- 
bases, and other digital initiatives, 
such as electronic resource manage- 
ment, are only a few such innovations. 
During this time of technological 
advancements, cataloging depart- 
ments should make the most efficient 
use of such tools as Class Web, not 
only by putting it to use within their 
own unit, but also by promoting its 
use to other noncataloging units, and 
thereby furthering the collaboration 
that already exists between techni- 
cal services personnel and their col- 
leagues in public services. 
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Appendix A. Classification Web Survey for Catalogers 

Please check one: I am a copy cataloger I am an original cataloger 

1. How often do you use Classification Web (Class Web)? 

More than once a day 2-3 times a week Less than once a week Once a day 

I do not use Classification Web 

2. How often did you use Classification Plus (via Cataloger's Desktop) before the transfer to Class Web? 
More than once a day 2-3 times a week Less than once a week Once a day 

I do not use Classification Plus 

Please forward the completed survey to Anna Ferris. Thank you. 
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Appendix B. Daily Usage Data from Monthly Summary Reports 



Date 


Nov. 03 


Dec. 03 


Jan. 04 


Feb. 04 


Apr. 04 


Nov. 04 


1 




3 








8 


2 




3 


1 


4 




5 


3 




4 




5 




3 


4 




5 




4 




3 


5 




2 


4 


2 


3 


4 


6 






5 


2 


4 




7 


4 




3 




4 




8 




5 


4 


1 


4 


4 


9 




5 


5 


3 


5 


3 


10 


3 


3 




3 




3 


11 


3 


3 




4 


1 


2 


12 


3 


3 


3 


3 


15 


1 


13 


4 


1 


3 


3 


3* 




14 


2 




3 




9 




15 




4 


4 




3 


1 


16 




4 


3 


3 


4 


3 


17 


2 


3 




6 




3 


18 


3 


2 




3 


1 


4 


19 


4 


3 




4 


5 


4 


20 


4 




3 


4 


5 




21 


3 




3 


1 


4 




22 




2 


2 


1 


4 


1 


23 




3 


4 


7 


4 


4 


24 


4 


1 




5 




2 


25 


3 




1 


5 






26 


5 




3 


4 


3 




27 






6 


5 


4 




28 






4 




4 




29 




2 


3 




4 


3 


30 




4 


4 




4 


3 


31 




2 










Mean 


3.357 


3.045 


3.381 


3.565 


4.409 


3.2 


StDv 


.841 


1.174 


1.203 


1.561 


2.823 


1.576 



Note: Shaded boxes indicate weekends and holidays; 'indicates the training day. 
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Book Reviews 



Edward Swanson 

The Complete Guide to Acquisitions Management. By 

Frances C. Wilkinson and Linda K. Lewis. Westport, Conn.: 
Libraries Unlimited, 2003. 312p. $45 softbound (ISBN 1- 
56308-892-4); $60 hardbound (ISBN 1-56308-890-9). 

Boss Atkinson writes, "The place and image of the 
library in the institution is mirrored in the position and 
perception of the acquisitions operation in the academic 
library." 1 Acquiring and managing access to information 
resources remains a core function within all types of librar- 
ies. However, the library acquisition function has changed 
during the past decade. The exponential growth of elec- 
tronic resources combined with budgetary constraints and 
rising information costs have resulted in many challenges 
for libraries. It is important that librarians and support staff 
responsible for library acquisitions have the skills and knowl- 
edge necessary to successfully manage the acquisition and 
delivery of information resources. 

The Complete Guide to Acquisitions Management was 
written to fill a gap in library education. The authors con- 
ducted a survey of American Library Association-accredited 
graduate library programs in 1999 and summarized the out- 
comes in the introduction. Besults of the survey (64 percent 
response rate) indicated that only 13.9 percent of graduate 
library programs teach a separate course on acquisitions 
management, but 93.5 percent do include it as a component 
in other courses. The majority of respondents (79.3 percent) 
supported the publication of a new textbook on acquisitions 
management. The authors' goal was to publish an up-to-date 
resource to support training of staff new to library acquisi- 
tions and "provide the reader with both procedural and 
philosophical approaches to acquisitions"(xi). 

Authors Wilkinson and Lewis are well qualified to 
write a book on acquisitions management based on their 
own relevant job experience at the University of New 
Mexico and numerous publications on library acquisition 
related topics. The authors effectively synthesize infor- 
mation gathered from numerous resources into a well- 
organized volume that succinctly covers a broad range 
of topics. However, this book does not provide the same 
level of detail and discussion that appeared in the highly 
regarded second edition of Understanding the Business 
of Library Acquisitions, edited by Karen Schmidt. 2 What 
the book does offer is a good introduction to acquisitions 
management, educating the reader on core acquisition 
functions. Beferences and links throughout the book lead 
the reader to resources that will most often provide more 
in-depth discussion of topics. 



The book is organized into twelve chapters that address 
the following topics: organization of acquisition depart- 
ments; acquisitions systems; publishing industry; vendor 
selection and evaluation; acquiring books and media; 
serials; electronic resources; out-of-print and antiquarian 
materials; gifts and exchange programs; bindery opera- 
tions; outsourcing acquisitions; and professional ethics. An 
appendix follows that includes useful references and links 
to automated acquisitions systems, relevant conferences, 
publisher and vendor sites, electronic discussion lists and 
Web sites, acquisitions-related journals, organizations, and 
reference tools. A glossary of more than two hundred terms 
(some with links to additional resources) is included. An 
index to the book's content completes the volume. 

Chapters 4 through 9 offer the most immediate value to 
staff new to library acquisitions. Chapter 4 provides excel- 
lent discussions on vendor types, vendor selection criteria, 
the basics of the request for proposal (BFP) process, and 
vendor evaluation criteria; the BFP discussion is espe- 
cially useful for anyone not experienced with that process. 
Chapters 5 through 9 address the specifics of acquiring dif- 
ferent formats (such as books) and the management of gifts 
and exchange programs. Of particular interest is chapter 7, 
"Acquiring Electronic Besources." During the past decade, 
electronic resources have increased in both number and 
acceptance by library users. A particular challenge for librar- 
ies has been how to effectively integrate management of 
electronic resources into existing organizational structures. 
Chapter 7 provides the reader with a good introductory 
treatment of electronic resource management. New devel- 
opments in the area of electronic resource management 
since publication of this book, such as the Digital Library 
Federation Electronic Besource Management Initiative, 
will require the reader to supplement their knowledge with 
more current information. However, if the reader refers 
to die appendix or glossary, there are references and links 
to useful resources (for example, the entry under Digital 
Library Federation in the glossary includes a link to the 
federation's Web site). 

Chapter 11, "Outsourcing Acquisitions," provides a bal- 
anced discussion of when to outsource, what to outsource, 
and the advantages and disadvantages of outsourcing. As the 
authors point out, "Outsourcing should be a management 
tool, not a goal" (216). Chapter 12, "Professional Ethics," 
provides a good introduction to the issues, especially eth- 
ics in acquisitions. This is a topic that is often inadequately 
addressed in similar texts. The appendix and glossary add 
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value to the book by providing the reader with access to a 
wide range of useful resources. 

The Complete Guide to Acquisitions Management does 
achieve die authors' goal of providing an up-to-date, com- 
prehensive guide to acquisitions management for staff new 
to the field. Published in 2003, the content remains timely. 
As previously noted, the book does not provide the detailed 
level of discussion that more experienced acquisitions 
and electronic resource librarians require that appeared 
in Understanding the Business of Library Acquisitions. 
However, this book does provide a good introduction to 
acquisitions management and is recommended for gradu- 
ate library school collections and library technical service 
or acquisitions department reference collections. — Robert 
Alan (roal@psulias.psu.edu), Pennsylvania State University, 
University Fork 
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Managing Information Technology: A Handbook 
for Systems Librarians. By Patricia Ingersoll and John 
Culshaw. Westport, Conn.: Libraries Unlimited, 2004. 199p. 
$45 hardbound (ISBN 0-313-32476-X). 

This handbook is a much-needed text regarding sys- 
tems librarianship and covers topics not encompassed by 
or in need of updating from earlier works by Thomas C. 
Wilson and Bachel Singer Gordon. 1 Many systems librar- 
ians today are still self-taught, despite the widening of 
library school curricula to include more information tech- 
nology basics, and this handbook is a must for the desks of 
those librarians. Although the Wilson text contains a more 
thorough (though rapidly becoming dated) treatment of 
the history of computers in libraries and the philosophy of 
systems librarianship, Ingersoll and Culshaw provide more 
than adequate introductory material and address a wide 
assortment of subject areas relating to information tech- 
nology. Including many practical tools and lots of tips and 
advice, their work amounts to a compilation of the many 
and sundry things they have learned and developed dur- 
ing their collective thirty years of on-the-job learning in an 
academic library setting. 

The first chapter of the work is devoted to planning. 
With technology playing the major role it does in today's 
libraries, the importance of the systems librarian's involve- 
ment from the outset in any planning process is stressed. 
The authors appropriately emphasize that successful results 
are particularly dependent on systems staff involvement 
in any planning related to integrated library system (ILS) 



migration. With the flux in the technology marketplace and 
the accompanying demise of so many ILS vendors and sys- 
tems, system migration has become a high priority for many 
systems departments, and those not affected now can expect 
to have to deal with the problem somewhere down the road. 
Ingersoll and Culshaw therefore are right to consider the 
planning for migration as a subject to be handled in some 
detail, as they do. 

The authors emphasize that reporting lines and staffing 
should respond in a dynamic way to environmental changes. 
They note that close alliances between the library and the 
computer center are likely on today's college and university 
campuses; if a formal reporting connection does not exist 
between the library and the computer center, one can gen- 
erally expect that there is a strong informal relationship, and 
they stress communication with the intent of shifting "the 
emphasis from control of information to sharing it" (39). 
Good advice. 

A systems librarian must be an excellent communica- 
tor within his or her own organization and also in inter- 
acting with peers outside their library as well as with oth- 
ers who may not be so knowledgeable about the ins and 
outs of technology. Ingersoll and Culshaw stress that the 
use of technical jargon and acronyms can often inhibit 
communication with those outside the systems depart- 
ment, in much the same way as the use of library jargon 
and acronyms can sometimes inhibit communication 
between librarians and nonlibrarians. 

The wide variety of topics covered in the work is both 
a strength and a weakness. On the plus side, this work 
will surely open the eyes of some students and even some 
systems librarians to the ever-expanding role that they are 
being asked to play. On the odier hand, some of the impor- 
tant topics covered, such as statistical reporting, link check- 
ing, wireless computing, and other service-related topics, 
are covered in a summary style, taking up only a few pages. 
The reader will often wish uSat these and a number of other 
topics were covered in more detail. 

For a book published in 2004, the dates of the literature 
cited seem a bit problematic. The authors quote abundantly 
from literature published in the mid-1990s or even earlier. 
Given the subject nature of the work and the fluid nature of 
the systems field, the reader is often left wondering about 
the relative absence of systems librarianship literature from 
2000 and beyond. AMiough such literature is not plentiful, it 
is always difficult for the skeptical reader to give credence to 
the continuing validity of materials more than ten year's old 
in this rapidly changing field. The authors might have found 
it fruitful and useful to their readers to do a better job of 
explaining the relevance of the materials cited, even if only 
as an attempt to provide a historical perspective. 

At 132 pages, with an additional 49 pages devoted to 
resource materials containing a sample local area network 
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(LAN) account policy, a sample acceptable-use policy, and a 
library technology plan, the handbook will serve its readers 
well as a way to get their arms around a wide-ranging sub- 
ject. Other resource material provided includes ILS vendor 
information, a job evaluation form, a sample of an online 
help page, a professional reading list, and a good bit more. 
These resource materials will no doubt be helpful to systems 
librarians at many levels and in many situations. 

Although library and information science education 
schools should now be supplying future librarians with sys- 
tems librarianship specializations, most practicing systems 
librarians today got into their roles mostly because they 
are the type of people who feel comfortable with and are 
undaunted by computers, hardware, and software. Especially 
for this virtually self-taught type of systems librarian, the 
Ingersoll and Culshaw text should be a most welcome addi- 
tion to their collection. — Vicki L. Gregory (Gregory@shell. 
cas.usf.edu), University of South Florida, Tampa 
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Binding and Care of Printed Music. Music Library 
Association Basic Manual Series, no. 2. By Alice Carli. 
Lanham, Md.: Scarecrow, 2003. 179p. $52.95 cloth (ISBN 
0-8108-4651-9); $29.95 paper (ISBN 0-8108-4652-7). 

This straightforward manual on binding music scores 
and maintaining music collections is the second volume in 
the Music Library Association's Basic Manual series. It is 
aimed principally at "music librarians with little conserva- 
tion background and to library conservators with little music 
background" (vii). The author notes that there are many 
other resources that cover the binding and conservation of 
printed books, and that this manual pertains specifically to 
the use of those techniques for binding, conserving, and 
preserving printed music. What sets it apart is the author's 
knowledge of how printed music is used, because such 
knowledge is essential to ensuring that the music will be 
useable after it is bound. Music librarians and music collec- 
tion managers are well-aware that printed music bindings 
must meet study and performance requirements — scores 
should lie flat on a music stand, and parts should not be 
jammed into pockets that are too tight or improperly con- 
structed. Thus, the author addresses the particular stresses 
to which printed music is liable that necessitate bindings 
that allow for openability, secure page attachment, and stor- 
age of loose parts. 

There are several options for music binding, as there 
are for binding books and journals. Carli describes each type 
of binding and comments upon its suitability for reading 



or performance. She then gives clear, detailed, and precise 
instructions for making each kind of binding. There is sound 
advice throughout for those who make overarching decisions 
about repairing and replacing music materials. The instruc- 
tions for all aspects of the binding process are succinct, free 
of jargon, and easy to follow. There is an extensive glossary 
covering terms that are specific to binding, conservation, 
and preservation. 

The verbal instructions are helped immensely by the 
accompanying simple line drawings, making it fairly easy to 
carry out the step-by-step instructions for the binding and 
conservation tasks described. Additionally, even if one does 
not carry out the instructions oneself, their usefulness lies in 
the detailed explanations of how bindings are constructed, 
when one type of binding is to be preferred over another, 
and what is to be avoided all together. It is a valuable manual 
for training bindery workers, and would also be a worth- 
while textbook in a music librarianship class. People who are 
working with commercial binders will also find these expla- 
nations helpful as they write their binding instructions. 

The chapter devoted to working with commercial 
binders is a compendium of advice on workflow, shipping 
preparation, and record keeping. The explicit instructions, 
such as those for preparing a shipment for the bindery and 
receiving a shipment back into the collection, are invalu- 
able to anyone who is new to the job and does not even 
know what questions to ask. They are also of value for 
reevaluating existing procedures. The author suggests that 
most of the work should be done by students under the 
supervision of a staff member who is well-versed in bindery 
procedures. Of course, the amount of preparation done at 
any library depends upon the level of staffing. One sugges- 
tion that the author does not mention, but that does make 
a real difference in one's approach to working with a binder, 
is to tour the bindery and talk to the people there about 
their procedures. 

Carli's view that there is a decline in the quality found 
among commercial binders, due to low pay and repetitive 
work, seemed a tad harsh, considering the pride that most 
binders take in the quality of their work and their com- 
pliance with die Library Binding Institute Standard for 
Library Binding. 1 However, her point that it is the respon- 
sibility of the library to correctly prepare the material and 
provide clear and complete instructions is well-taken. 

Most library managers find it difficult to carry out 
conservation and preservation plans for their existing col- 
lections in the face of the need to acquire and process new 
materials. Scores published before 1990, other than monu- 
mental pieces, tended to be printed on low-quality paper 
that has become brittle or disintegrated over time. Finding 
and repairing or replacing brittle or damaged scores is an 
undertaking that is beyond most libraries. Typically, librar- 
ies identify the damaged scores when they are returned to 
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circulation, which means that only those that are part of the 
standard repertory or are popular are treated. Carli suggests 
that a more proactive method of defining a group of materi- 
als to be tested for repair or replacement can be the basis 
for a grant proposal. Her discussion of the deacidification 
process is enlightening. It is clear that this is a process that is 
unlikely to be undertaken in any but the largest facilities, but 
any librarian could use what Carli has written as the basis for 
an informed grant proposal or a request for proposals from 
a conservation firm. 

Part of the process of determining how a piece will be 
treated is the development of a preservation policy based 
on the library's mission, with acknowledgment of local 
budget restrictions. The policy should be based on the col- 
lection manager's knowledge of the needs of the collection 
and the possibilities of finding replacement materials; the 
collection manager then brings the specialized knowledge 
of the various options for treatment. As in the sections on 
binding new materials, the chapter on repair and replace- 
ment is full of definitions and explanations of standard 
practices. For example, one might not think of microfilm- 
ing printed music, but this is the most reliable preservation 
method, and it is a format that can be printed or digitally 
scanned with equal success. 

The appendixes are welcome additions to the main 
text. The list of supplies and the drawing of tools and a 
properly laid out workstation are helpful for those not 
familiar with bookbinding. The sample conservation and 
preservation policies are useful for people who have little 
background in this area and are based on the author's 
expertise in the various techniques and options for preser- 
vation and conservation. 

This book gives useful advice on the practices and 
procedures of the care and handling of printed music to 
managers of printed music collections, whether or not they 
do their own binding and conservation. Most of it is aimed 
at institutions that do their own conservation and preserva- 
tion work, but it also offers sound advice for those that do 
not. The author's comments on the hazards of book repair 
include the danger of repetitive stress injuries and the need 
to have a work area that is designed with adequate ventila- 
tion to expel dangerous fumes. She even makes suggestions 
for varying the work to prevent workers from getting bored 
with the work and becoming sloppy. The detailed instruc- 
tions and illustrations in each chapter are instructive for 
music librarians and collection managers who make bind- 
ing decisions and have to communicate with commercial 
binders. The author's commonsense approach and thorough 
treatment of the topic make this a fine and unique refer- 
ence. However, the book would be greatly enriched by an 
index, its lack being the book's only real fault. — Christina 
Bellinger (christina.bellinger@unh.edu), University of Neiv 
Hampshire Library, Durham 
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Cataloger's Judgment: Music Cataloging Questions 
and Answers from the Music OCLC Users Group 
Newsletter. By Jay Weitz; arranged and ed. by Matthew 
Sheehy; foreword by H. Stephen Wright. Westport, Conn.: 
Libraries Unlimited, 2004. 265p. $46.95 cloth (ISBN 1- 
59158-052-8). 

Since May 1989, Jay Weitz, a consulting database 
specialist at the OCLC Online Computer Library Center 
(OCLC) who serves as the OCLC liaison to the Music 
OCLC Users Group (MOUG), and is a former music cata- 
loger himself, has been responsible for the question-and- 
answer (Q&A) column in the MOUG Newsletter. At the 
instigation of H. Stephen Wright, former MOUG chair, and 
Martin Dillon, Libraries Unlimited's acquisitions director, 
the complete text of all questions and all answers, from the 
first appearance of the Q&A column in May 1989 to the 
issue number 81, dated September 2002, of the MOUG 
Newsletter, was collected, indexed, and edited by Matthew 
Sheehy, and published in this monograph. 

In the preface to her Bibliographic Relationships in 
Music Catalogs, Sherry Vellucci reports that music catalog- 
ers often hear- the phrase "If you can catalog music, you can 
catalog anything!" "uttered with great respect by librarians 
whose primary responsibility is monographic print catalog- 
ing," and it is true that music cataloging is a tricky activity, 
requiring a vast array of skills. 1 Despite "Weitz's First Law: 
Don't agonize" (45) it sometimes proves to be of utmost 
difficulty not to agonize. In that regard, the Q&A column 
in the MOUG Newsletter is certainly an extremely helpful 
tool. The information provided there is always clear, accu- 
rate, and written in a pleasant way — as H. Stephen Wright 
says in his foreword, Jay Weitz's answers are "not in the 
stilted, officious voice of AACB2 or Library of Congress rule 
interpretations, but in a friendly, casual tone that suggested 
that he was one of us" (xii). Some of the catalogers who 
sent him their questions took that opportunity to confess 
to him that they cut all Q&A columns out of die MOUG 
Newsletter and keep them on their desk as a collection of 
particularly precious clippings. Such a testimony tends to 
show that this publication is more than welcome and meets 
an actual need. 

Having said that, I do have a few reservations. To be 
fair, I have to admit that both the author and the editor 
are, themselves, aware of most of them and responded to 
them in advance in their introduction. The choice of com- 
prehensiveness — almost none of the Q&As, even among 
the oldest ones, was discarded, "except for a few stray 
ones that were hopelessly misinformed" (xviii) — results in 
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"countless references to outmoded technology; superseded 
documentation and page numbers; old forms of headings; 
dead URLs; bibliographic and authority records that have 
long since changed; ancient rules; and obsolete practices" 
(xviii-xix). The categories under which Q&As are filed do 
not always seem perfectly satisfactory, but admittedly "there 
was no intuitively obvious logical organizing principle and 
... no matter what choices [the author and the editor] 
made, there was no pleasing everyone" (xviii). Some of the 
Q&As do not really address cataloging issues in the generic 
sense of die term, but rather are down-to-earth advice about 
how to use the OCLC system that strives to remain as ortho- 
dox as possible while coping with the technical limitations of 
that specific system (e.g., questions 1.1, 1.10, 3.19, etc.) — a 
concern quite understandable and legitimate in the context 
of the MOUG Newsletter, but that sounds a bit out of scope 
in the context of a self-contained monograph. Such Q&As 
are of little help for catalogers who catalog on a different 
system; perhaps these should have been dropped for the 
monograph publication. 

Wright reports in his foreword that MOUG considered 
for a while "placing all the columns on a Web site" (xiii). This 
would certainly have been a better mode of publication than 
the printed format, as it would have enabled the constant 
updating of the valuable information provided in these col- 
umns. One can hope that this book will soon be transferred 
to the Web, its appropriate place. 

Updating is a real issue for publications of this kind. Will 
most of the Q&As be still relevant, once the new cataloging 
code, Resource Description and Access (RDA), is in place? 
If Metadata Object Description Schema (MODS) or some 
other XML-based format happens to supersede MARC21 in 
the future, three-quarters of this book will become obsolete 
at once, for the real topic addressed in this book is not so 
much cataloging as it is the MARC21 format itself — and its 
inadequacy. This book can also be read as a kind of compen- 
dium of everything that is wrong with MARC21 and could 
have been titled How to Do One's Best with Inadequate 
Tools. For example, many pages (128-145) are dedicated to 
field 246, Jay Weitz's personal nightmare. The very existence 
of that field does not seem more justified after reading all 
those pages than before. The questions relating to field 041 
(pages 150-56), dedicated to language codes, show that this 
field was primarily designed for printed textual materials 
and does not suit quite well for sound recordings. Weitz 
admits, "Regarding the 041 subfield fb question, I have 
taken MARC21 at its word, though I'm not sure my inter- 
pretation is correct" (151). 

Similarly, even more than the "catalogers judgment," 
what this book tends to highlight is the inappropriateness 
of many current cataloging rules. Weitz has too much 
genuine respect for the rules to make the point, but many 
of the problems he had to solve in his Q&A column would 



vanish into thin air if the rules were more consistent. Of 
course, as Weitz puts it, "real-world instances, in spite of 
our never-ending efforts to codify practices, will always 
defy those efforts" (xix), but some ambiguities could be 
avoided. For example, there is a basic ambiguity about what 
it is we catalog at all when we catalog a sound recording: 
is it just the content infixed on the carrier (the recorded 
sound), the carrier itself (e.g., the CD), or the whole pack- 
age that the publisher intends as a product (the CD plus 
its container, liner notes, so-called accompanying material 
in our traditional but erroneous terminology, and so on)? 
This basic ambiguity results in a difficulty that many sound 
recording catalogers encounter: How to assign a correct 
publication date to a sound recording (pages 23-26 and 
elsewhere). There is a confusion between dates actually 
pertaining to the publication described, dates pertaining to 
bibliographic antecedents, and dates pertaining to the his- 
tory of the recorded sound itself. Hopefully, RDA will solve 
such problems. — Patrick he Boeuf (patrick.le-boeuf@bnf. 
fr), Bibliotheque nationale de France, Paris 
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Copyright in Cyberspace 2: Questions and Answers 
for Librarians. By Gretchen McCord Hoffman. New York: 
Neal-Schuman, 2005. $75 paper (ISBN 1-55570-517-0). 
Copyright for Teachers and Librarians. By Rebecca 
P. Butler. New York: Neal-Schuman, 2004. $59.95 paper 
(ISBN 1-55570-500-6). 

Books appearing more or less simultaneously on essen- 
tially the same subject often evoke from contemporary 
reviewers some reference to "The Blind Men and the 
Elephant" poem based upon the Indian legend about a 
number of sightless men who, upon encountering it for the 
first time, touch and feel different parts of die great beast 
and draw markedly different conclusions as to its nature. 1 
Both of these two books, by contrast, draw much the same 
conclusions, although they arrive at them by way of some- 
what differing approaches. Both are comprehensive works, 
covering much of the wide range of subjects pertaining to 
copyright law that are implicit in library operations, whether 
from the technical services or readers' services sides. 

Hoffman, described in the foreword to her book as that 
rarity — a librarian who went to law school rather than the 
perhaps more typical lawyer who went to library school — 
provides the more legalistically oriented of the two works. 
Copyright in Cyberspace 2 contains numerous citations to 
and lengthy excerpts from the applicable statutory law and 
court cases, and is an updated second edition of Hoffman's 
2001 work, which does a good job of bringing up to the pres- 
ent a number of the concepts and problems discussed in die 
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first edition. Part I improves and brings up to date the excel- 
lent short history of the copyright laws and basic applicable 
concepts contained in the first edition, and is especially use- 
ful in regard to the key fair use principle. 

Parts II and III undertake to apply the principles of 
copyright law to specific library situations and emphasize 
the potential copyright pitfalls librarians now typically 
face on almost a day-to-day basis. Effectively emphasizing 
that there do not always exist bright-line answers to every 
question, Hoffman rightly urges librarians to become more 
involved in developing the public policy issues at work in the 
copyright arena, especially when it come to matters involv- 
ing revising laws to reflect the digital and electronic realities 
presented in libraries today. 

Hoffman utilizes a sidebar Q&A supplementation to 
concurrent textual discussions in each chapter and to the 
general Q&A format used in each chapter (a sort of ques- 
tion-within-questions approach) in order to highlight impor- 
tant issues, an approach that is likely to be effective for the 
casual user treating the work more as a reference tome than 
a treatise on its subject. 

To work best as a reference book, however, a Q&A 
format demands a comprehensive index, and it is here that 
the work falls down a bit, as did the first edition. The table 
of contents might have been of some significant help in this 
regard if it had included all the questions for which answers 
are provided in the text (as the first edition did), but that 
degree of comprehensiveness has been eliminated this time 
out. A major portion of the work remains devoted, as was 
the case with the first edition, to lengthy exposition of pri- 
mary legal sources that, given today's ready Web access to 
such public domain sources, makes for a longer book than 
seems justified. 

Overall, though, Copyright in Cyberspace 2 continues 
the first edition's valuable contribution to the literature in 
the field, and combines a librarian's insight with a lawyer's 
knowledge to provide a useful addition to the shelves of 
librarians for whom the work is intended. Hoffman also 
commendably eschews the lawyer's tendency to overexplain 
or pontificate, and her accessible writing style results in a 
readable book on a difficult and technical subject. 

Rebecca Butler is not a lawyer by trade, but Copyright 
for Teachers and Librarians does not suffer from either a 
lack of authoritativeness or an overly academic approach. 
Indeed, it is obvious that it was written to be read rather 
than referred to. Butler has applied and adapted her work in 
developing copyright workshops, classes, and presentations 
to give the profession a useful primer on copyright that is 
accessible and likely to be useful on a daily basis. Working 
from essential concepts in part I, she brings the reader 
smoothly through clear discussion of fair use and licens- 
ing concepts to specific applications in various situations in 
part II. The table of contents and list of figures and charts 



are very good, and they resolve many of the problems that 
(again) a somewhat foreshortened index might have other- 
wise suggested. 

When dealing with specific situations and questions, in 
most respects Butler arrives at the same ultimate conclu- 
sions as Hoffman, but when, as is frequently the case, there 
is no clear answer, Butler tends to opt for the conservative 
view more often than Hoffman, usually recommending 
against copying, using, and so on, in the close cases. She 
provides her recommendations primarily through a series of 
flowcharts appearing frequently in each chapter of part II, 
with yes/no decision trees down the paths of which librar- 
ians may navigate. She supplements these decision tools 
with relevant contextual discussion, making the flowcharts 
much more useful to the discerning reader. 

Whether the copyright law, or any other law within 
the Anglo-American legal tradition, based as it is on court 
decisions and interpretations as opposed to statutory pre- 
scriptive absolutes, necessarily lends itself always to yes or 
no analysis is beside the point so long as the author's fre- 
quent caveats are observed. And, as previously mentioned, 
because Butler almost ineluctably leans to the conservative 
approach, librarians following their decision trees will rarely 
find themselves in a compromised or embarrassing position 
regarding use of the library's copyrighted materials. 

Copyright for Teachers and Librarians would perhaps 
benefit from some explication of possible and appropriate 
solutions to those copyright law dilemmas that lead librar- 
ians to take (and Butler often to recommend) a conservative 
approach. Certainly the library profession needs to be in 
the forefront of providing access to materials, thoughts, and 
ideas, not serving as surrogate copyright policemen intent 
on ensuring that materials are squirreled away for the use 
only of those who can pay each time for their use. Some 
sort of afterthoughts chapter, which would not disturb the 
otherwise helpful natural flow of the book, might have been 
useful in this regard. One would expect Butler's views in this 
regard to be incisive. 

So, is one view of the copyright elephant any more 
accurate than the other as far as the librarian is concerned? 
Happily, these books complement each other in many ways. 
For those looking for a straightforward, handbook type of 
approach to use in applying the mysterious ways of copy- 
right law to specific, everyday library situations, Copyright 
for Teachers and Librarians might be the better choice. 
But for more detail in respect to the ins and outs, and the 
benefits that can be derived from "further review" through 
examination in some detail of the statutory provisions and 
court decisions regarding copyright law, there is much to 
recommend in Copyright in Cyberspace 2. The reader of 
both will find the elephant most clearly in focus. — Vicki L. 
Gregory (Gregory@shell.cas.usf.edu), University of South 
Florida, Tampa 
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Putting XML to Work in the Library: Tools for 
Improving Access and Management. By Dick R. Miller 
and Kevin S. Clarke. Chicago: ALA, 2004. 205p. $45 ($40.50 
ALA members) paper (ISBN 0-8389-0863-2). 

Extensible Mark-up Language (XML) has been at the 
heart of many discussions for ten years and was sometimes 
introduced as a kind of miraculous panacea in outbursts of 
enthusiasm verging on the irrational. XML was not primarily 
designed for librarians, but quite early some librarians — 
especially those who had been considering replacing MARC 
formats with SGML-based formats — saw the potential it 
has for the profession. Some commentators, however, also 
expressed reservations about XMLs ability to deal with huge 
amounts of bibliographic data. The authors are both aware 
of XMLs limitations and convinced that XML can do a lot 
for us: "While XML cannot solve all of our problems, it does 
offer foundational tools to help transform the way libraries 
do business" (36). Indeed, it is almost a matter of survival, 
as the new environment — the Web environment — has pro- 
foundly transformed libraries' role and place within the soci- 
ety: "'Library information,' especially that in time-honored 
MARC formats and in proprietary integrated library system 
formats, has been segregated too long from mainstream Web 
resources. Having an online library catalog isn't good enough 
anymore" (37) and "Conditions for libraries have changed! . . 
. With instant information everywhere, libraries need to reas- 
sess their role and focus on strategies for mriving under the 
new circumstances" (96). Obsolescence and growing isola- 
tion are the major threats impending on libraries and librar- 
ians. The authors regard XML as a way to escape both. 

This book (written in 2002, published in 2004, and 
reviewed in 2005 for this 2006 issue of LRTS — it is impor- 
tant to keep that time aspect in mind) can be regarded as 
comprising two distinct sections. The first one (chapters 
1 and 2) is a presentation of XML itself and XML-related 
technologies (validation tools, linking tools, display tools, 
and so on), a kind of XML manual for librarians who are 
not acquainted with the mark-up language. The second one 
(chapters 3 to 5) is more library-specific and exposes how to 
develop — and put into practice — an XML-based metadata 
schema, with the potential to solve the many flaws that cata- 
loging rules and MARC formats are fraught with. 

Librarians who seek guidance for the development of 
their first XML Document Type Definition (DTD) or XML 
schema will find a step-by-step methodology that will prove 
extremely helpful on pages 94-96. Perhaps more important- 
ly, this section offers the authors an opportunity to express 
what they think of MARC21 and the Anglo-American 



Cataloguing Rules — and they do not think good things, 
to say the least. They develop seven arguments in favor 
of XML's superiority over MARC: coded values in fixed- 
length fields could be replaced with the flexible authority 
control enabled by XML; the inconsistent way dates are 
expressed in MARC could be unified in XML; redundancy 
and inconsistency in entering similar types of information 
could be avoided; MARC does not clearly separate infor- 
mation elements and information about them, which XML 
would make possible; relationships could be expressed in 
a unified way; MARC's complexity could be replaced with 
a core XML schema to which specialized information ele- 
ments could be added for certain types of materials (e.g., 
music, maps); and the MARC-8 character encoding system 
could be replaced with Unicode. According to the authors, 
XML could also help solve some of the problems posed by 
AACR: no clear identification of works; no consistent treat- 
ment of relationships; too much emphasis on transcription 
and description; use of mixed-language headings, which 
impedes internationalization; inconsistent treatment of ini- 
tial articles in titles, and so on. 

The audiors dien introduce die XML-based meta- 
data structure that they have developed at the Lane Medical 
Library, the XML Organic Bibliographic Information Schema 
(XOBIS) — in my opinion the biggest revolution in the cata- 
loging world since Cutter's time. XOBIS blurs the tradition- 
al — and cumbersome — border between bibliographic and 
authority records. It enables consistent treatment of biblio- 
graphic relationships and controlled use of qualifiers within 
headings. The authors insist, however, that XOBIS is "experi- 
mental," and that "it should not be interpreted as minimizing 
the problems such an undertaking [i.e., die replacement of 
MARC widi a Web-oriented schema] would entail" (144). It 
would be fascinating to see what a large-scale bibliographic 
database in XOBIS might look like. 

Unfortunately Chapter 4, which is devoted to die soft- 
ware and practical tools that would enable daring librarians 
to "put XML to work" in their library, is a bit disappointing. 
Not that it is not helpful, but it focuses almost exclusively on 
open source software that perform the following functions: 
edit XML documents; transform XML into other formats; 
display; store and index; or any combination of the above. Of 
course, such a publication could not and should not turn to 
a collection of advertisements for commercial systems and 
the vendors who supply diem, but it is not always possible 
to find the qualified staff able to adapt open source software 
for a libraiy's specific needs, and one has to be completely 
informed to make the good decision and the good choice. 

The fifth and final chapter, devoted to XML's poten- 
tial for the future, shows a number of the Lane Medical 
Libraiy's achievements that were made possible dianks to 
XML. Among other realizations, they maintain an online 
serials list from dieir catalog, they use XML "to assist with 
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the retrieval of information that sometimes gets buried in, 
or omitted from, the MARC format" (183), and they locate 
journal articles and so on. 

One of the many lessons that this highly valuable book 
has to offer is that lazy solutions consisting in just transfer- 
ring MARC fields into XML tags are not the best ones and 
do not put librarians in a position to envision the future with 
serenity. "Rather than just attempting to translate existing 
knowledge structures directly into XML, we have a strategic 
opportunity to redefine these structures in order to support 
future information systems. . . . We advocate the need for 
fundamental changes in order to achieve a viable replace- 
ment schema" (92, 93). Is the profession ready to follow 
pioneers and to invest time, thinking, and money in the 
XML revolution? — Patrick he Boeuf (patrick.le-boeuf@bnf 
fr), Bibliotheque nationale de France, Paris 

Who Wants Yesterday's Papers?: Essays on the Research 
Value of Printed Materials in the Digital Age. Yvonne 
Carignan et al., eds. Lanham, Md.: Scarecrow, 2005. 224p. 
$48 paper (ISBN 0-8108-5119-9). 

Have you ever noticed that when an issue comes to one's 
attention, permutations of it seem to arise at every turn? 
Recently, while listening during my daily commute to a CD 
educational series on classical music, I heard the instructor 
bemoan the belief of some scholars that fully 40 percent of 
the musical works of Johann Sebastian Bach — that would be 
in the range of 400 works — has been lost, and that the paper 
they were written on was likely used to wrap cheese or to 
provide insulation for the roots of plants and trees. 1 During 
the same period, in preparation for a family move, I sorted 
through the entire kindergarten through twelfth-grade aca- 
demic output of my two offspring and decided what among 
the hundreds of pages of math worksheets and other busy- 
work could be kept and what discarded. As I composed this 
review, I reflected diat loss and preservation are hardly new 
or unusual issues, but ones that have long pervaded both the 
public and private spheres. 

In 2001 Nicholson Baker, through the publication of 
his provocative Double Fold: Libraries and the Assault on 
Paper, created a flurry of attention on libraries' decisions 
to replace fragile or deteriorating collections of newspa- 
pers with microfilm. 2 Although Double Fold and its author 
have been widely criticized for arguably oversimplifying or 
obscuring the issues, they have served a valuable purpose by 
fostering a great deal of professional and public discussion of 
the issues surrounding the preservation of original materials 
in an increasingly digital age. Among the responses to Baker 
was a symposium organized by University of Maryland 
librarians and graduate students from the university's col- 
lege of information studies. 

While Double Fold focused on newspapers (and library 
card catalogs), the symposium's organizers had a far more 



ambitious objective, to address "die whole question of what 
original materials should be saved more broadly" (ix). The 
organizers' strategy, culminating in the publication of Who 
Wants Yesterday's Papers? was to promote a dialog between 
researchers from a range of academic disciplines — including 
humanities and the social and physical sciences — and the 
librarians and preservationists who face die daunting task of 
short-term decision-making and long-term planning in these 
areas. The book contains both die symposium presentations 
and further essays added to provide a more complete over- 
view of various aspects of die issue. The result provides a use- 
ful introduction to the complexities of the topic of the greatly 
differing research needs of varying disciplines. However, it 
also accomplishes much more by providing both a historical 
perspective on library preservation (primarily in part one, 
"The Race against Time") and an introduction to the techni- 
cal and other problems associated with digital preservation. 
Many of the essays are extensively footnoted, and the book 
also contains an annotated bibliography of sources for further 
research. AMiough there are current issues not explicitly 
dealt with due to the fact that the symposium took place in 
2002, such as developments in the area of government docu- 
ments and depository libraries and the Google digital initia- 
tives, Who Wants Yesterday's Papers? provides an excellent 
springboard for further exploration of the topic. 

Part two, titled "Digital Demand vs. Paper Pleas," 
explores the importance of original paper documents from 
the perspectives of University of Maryland professors in 
various disciplines. These scholars were asked what types of 
materials they used in their research; how important paper 
materials were, as opposed to microfilmed or digital materi- 
als; and whether and how their reliance on original materials 
had recently changed or would change in the foreseeable 
future. Science historian Stephen G. Brash's argument that 
old science textbooks, far from being outdated and useless, 
are crucially important takes on particular forces, consider- 
ing the ongoing debate over the teaching of evolution and 
intelligent design; it's difficult to imagine a better illustration 
of how textbooks as "social artifacts" (40) provide insight 
into the values and convictions of their respective eras. By 
contrast, physicist Jordan Goodman and archivist Kara M. 
McClurken explore the advent of online scholarly publish- 
ing, extolling its potential for rapid publication of scientific 
research results and enhanced methods of peer review, while 
touching upon copyright and preservation issues. They also 
address the problems reliance upon digital information 
pose for the researcher, describing how historian Michael 
Bellesiles was stripped of his academic position and awards 
after being unable to reproduce some of the research data 
used in writing his book Arming America: The Origins of a 
National Gun Culture. 3 John E. Newhagen's essay "Above 
the Fold: The Value of Paper Newspapers" presents several 
cogent examples of how "the preservation of the physical 
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form of yesterday's papers will be critical to preserving the 
meaning and content of the messages they embody" (87). 
A recent example is Laurel Leff's Buried by The Times: 
The Holocaust and America's Most Important Newspaper, 
in which Leff, a former newspaper journalist and current 
professor of journalism, exhaustively studied the New York 
Times's coverage of the campaign against European Jews 
from 1939 to 1945, analyzing how decisions about the con- 
tent and placement of news article on the genocide had the 
effect of minimizing public awareness and outrage. 4 

Part three, "Enduring Value," features the perspectives 
of a variety of preservation professionals and scholars seek- 
ing to explore the issues of how we should select materials 
for preservation and how to preserve them (and perhaps 
re-preserve them through reformatting) in the appropriate 
format for their audience, while taking into account the 
resources available for such preservation efforts. 

Richard J. Cox, of the School of Information Science 
at the University of Pittsburgh, who contributed the after- 
word to Who Wants Yesterday's Papers?, suggests that the 
answer to the question posed by the book's title is not, as the 
Rolling Stones concluded in their 1967 recording, "Nobody 
in the world," but rather that everyone wants them, and 
they want all of them. 5 According to Cox, the major error 
of such critics as Baker is the failure to recognize that selec- 
tion is an imperative, because not everything can be saved. 
Cox opines, "archivists and librarians have not sufficiently 
explained themselves or provided adequate reasons for 
some of their preservation activities. ... we cannot take 
for granted that we or our discipline will be understood or 
appreciated by external observers." 6 As a catalyst for dis- 
cussion of these important issues, Who Wants Yesterday's 
Papers? poses many more questions than it answers and 
provides avenues for further exploration and debate on 
the issues of the way society uses and values information. 
— Susan Herrick (sherrick@law.umaryland.edu), University 
of Maryland School of Law, Baltimore 
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Brief Reviews 

The Next Library Leadership: Attributes of Academic 
and Public Library Directors. By Peter Hernon, Ronald 
R. Powell, and Arthur P. Young. Westport, Conn.: Libraries 
Unlimited, 2003. 192p. $50 paper (ISBN 1-56308-992-0). 

Acknowledging the shortage of librarians entering the 
management ranks of academic and large public libraries, 
this book addresses a topic of great interest to the profes- 
sion and fills a need in the library literature. Beginning with 
a brief chapter supporting the claim that there is a shortage 
of librarians, the book continues with a literature review that 
provides a good introduction to the topic of leadership in 
libraries and makes some distinctions between the attributes 
needed by managers versus those needed by leaders. 

Three subsequent chapters list and analyze the quali- 
ties needed by Association of Research Libraries (ARL), 
Association of College and Research Libraries (ACRL), and 
public library directors. Each of these chapters treats the 
topic as a separate research study that begins with lists of 
potential attributes, a survey of leaders in the profession, and 
follow-up interviews of selected individuals. Whereas the 
lists of desired attributes arrived at through the surveys were 
agreed upon by those surveyed, there are clear differences of 
opinion when individuals are queried. The next few chapters 
compare the lists of attributes identified by ARL, ACRL, 
and public library directors, address tools that can be used to 
assess the leadership qualities and abilities of individuals, and 
review mediods for attaining leadership skills, such as work 
experience, leadership institutes, and mentoring, aMrough 
the authors do not endorse any particular medrod. Finally, a 
brief chapter concludes with a discussion on the use of head- 
hunters to identify and recruit potential leaders. 

Overall this book will be useful for librarians who would 
like to pursue leadership opportunities, and also for librar- 
ies that are recruiting for leadership positions. — Rebecca L. 
Mugridge (rlm31@psu.edu), Pennsylvania State University, 
University Park 

An Ounce of Prevention: Integrated Disaster Planning 
for Archives, Libraries, and Record Centres, 2nd ed. By 

Johanna Wellheiser and Jude Scott, with the assistance of 
John Barton. Lanham, Md.: Scarecrow; Toronto: Canadian 
Archives Foundation, 2002. 283p. $33 paper (ISBN 0-8108- 
4176-2). 

Wellheise and Scott have thoroughly revamped and 
expanded Wellheiser and Barton's 1985 first edition, an 
indispensable manual of its day. Retaining their Canadian- 
centric focus but encompassing a worldwide scope, the 
authors incorporate up-to-date approaches to disaster pre- 
vention, protection, preparedness, response, recovery, reha- 
bilitation (for collections, records, facilities, and systems), 
and post-disaster planning for water-damaged collections 
and records, including CDs and computer media as well as 
paper-based materials. 
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In the wake of damage from Hurricanes Katrina and 
Rita in 2005 on the United States Gulf Coast, disaster recov- 
ery rises to the forefront of many librarians' and archivists' 
concerns. The problem is that we are still talking, usually 
after the fact, about how to recover from a disaster, not 
how to protect collections and buildings before a disaster 
strikes, nor are we preparing to mitigate disaster. It takes 
money, time, and unrelenting commitment and attention to 
create, adopt, absorb, and practice disaster prevention and 
recovery. Once the adrenaline rush of the disaster is behind 
us, we complacently go back to life as usual until the next 
calamity. Most manuals deal with the best-case scenario of 
treating the damage within forty-eight hours to avoid mold, 
and An Ounce of Prevention espouses that approach. When 
the roof is gone, the staff has evacuated to another state, 
and whatever security forces in charge do not allow anyone 
to enter the area for weeks or months, not just hours, what 
hope is there for recovery of mold-damaged collections? 
The reality of catastrophic occurrences such as Katrina and 
Rita raises the question of the value of following instructions 
in a disaster recovery manual designed for a burst water pipe 
or leaky roof. Whole mindsets have to change so they do not 
fall back into the it-cannot-happen-to-me syndrome. 

An Ounce of Prevention lays out the groundwork in 
logical progression for librarians and archivists to start today 
to plan for all the stages of a disaster. Not only is it essential 
reading, but an absolute necessity to consult as a practical 
handbook while writing an institution's disaster plan. The 
detailed chapters go far beyond most published compila- 
tions of conference papers, which often focus on "how I 
did it in my shop." Wellheiser and Scott include two eye- 
opening tables dealing with the costs — per book and per 
box — of recovering and rehabilitating fire- and water-dam- 
aged books and archival documents. This book joins it prede- 
cessors — Sally Buchanan's Disaster Planning, Preparedness 
and Recovery for Libraries and Archives, Judith Fortson's 
Disaster Planning and Recovery, and Camila Alire's Library 
Disaster Planning and Recovery Handbook — in providing 
useful guidelines for pre- and post-disaster procedures. 1 
— Susan Hamburger (sxh36@psulias.psu.edu ), Pennsylvania 
State University, University Park 
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Out-of-Print and Special Collections Materials: 
Acquisition and Purchasing Options. The Acquisitions 



Librarian, no. 27. Ed. Judith Overmier. New York: Haworth, 
2002. $49.95 cloth (ISBN 0-7890-1674-5); $29.95 paper 
(ISBN 0-7890-1683-4). 

Guide to Out-of-Print Materials. ALCTS Acquisitions 
Guides, no. 12. By Narda Tafuri, Anna Seaberg, and 
Gary Handman. Lanham, Md.: Association for Library 
Collections & Technical Services, in cooperation with 
Scarecrow, 2004. 50p. $18.50 paper (ISBN 0-8108-4974-7). 

Despite the similarity of their titles, Out-of-Print and 
Special Collections Materials: Acquisitions and Purchasing 
Options and Guide to Out-of-Print Materials are two very 
different sorts of book, both in terms of purpose and integral 
quality. The former collects — or rather, amasses — a dozen 
articles of varying quality, from the intriguing to the banal. 
Adding to the confusion, the writers clearly had no shared 
vision of audience in mind. The uneven result is a haphazard 
assortment of articles, some of which — based on one's expe- 
rience and interest — may well be worth reading, and several 
of which should be ignored. 

Librarians new to collection development or acquisi- 
tions work may appreciate the primer-style contributions. 
Linda Fidler's "The Acquisition of Out-of-Print Music" 
clearly and succinctly reviews the vocabulary, formats, 
publishing landscape, and methods of print music acqui- 
sition. Stephen C. Wagner's "Acquiring Materials in the 
History of Science, Technology, and Medicine" performs 
similar service for its subject matter, but it also includes 
discussion of the development of collections policies and 
the selection of materials. Also of interest to new or new- 
to-rare-books librarians may be Marilyn Bailey Ogilvie's 
"Books from Abroad, One Collection Development 
Strategy." Focusing on the Western European trade, 
Ogilvie anecdotally shares common wisdom well-known 
to those experienced in the rare book world but that is 
seldom recorded in print for neophytes. 

Several articles offer exemplary approaches for collec- 
tion building, thoroughly describing a specific and perhaps 
unique collection, but in ways that may serve as models for 
others. In "Underground Poetiy, Collecting Poetiy, and the 
Librarian," Michael Basinski offers an extensive overview of 
underground poetry and its history, spelling out along the 
way implications for a special collection. In "Acquisitions in 
the James Ford Bell Library," Carol Urness details a true- 
to-life example of how collection scope and policies have 
been defined and renegotiated over time. Elaine M. Doak's 
"chat" titled "Decisions, Decisions, Decisions: A Tale of 
Special Collections in the Small Academic Library" offers 
another case study, one less exemplary perhaps only for its 
greater frankness. 

While the Purdue University Libraries' Books on 
Demand pilot project and subsequent studies of its results 
are both strong and strongly provocative, the content of 
Suzanne M. Ward's "Books on Demand: Just-in-Time 
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Acquisitions" may be found elsewhere in her co-authored 
articles. Collection development librarians should read this 
material in one of its iterations. 

Less helpful are the literature review articles. Editor 
Judith Overmier's own "Twenty Years of the Literature on 
Acquiring Out-of-Print Materials" unfortunately provides 
mere summary rather than a critical overview. Svetlana 
Korolev's "Gifts to a Science Academic Librarian" is little if 
anything more than a summary of others' work. 

Out of place in this book is John Alhouse's too-brief 
(approximately two pages) article, "Using Older Materials 
in Support of Teaching." Barbara Patterson's "Four Factors 
Influencing the Fair Market Value of Out-of-Print Books: 
Part 1" offers no new insights, and Overmier and Wallace 
Koehler's "Part 2" follow-up study provides only an example 
of what not to publish. Their own conclusion that their 
sample size is insufficient for reliable results neither fails to 
dissuade them from publishing their study, nor stops them 
from relying on this data for other conclusions. 

Unlike the former book, Guide to Out-of-Print Materials 
fulfills an explicit purpose, "to provide an overview of both 
the traditional and online resources available to the acquisi- 
tions librarian in locating and acquiring materials that are 
considered to be out of print" (1), covering resources related 



to books, videos, sound recordings, and serials. Presented in 
outline format, the guide provides a wealth of crucial infor- 
mation in concise, well-ordered snippets. Comprising mainly 
vendor contact and bibliographical resource information, its 
coverage with regard to English-language materials and the 
English-speaking world is excellent, and thus it deserves a 
place in any library acquisitions department. 

The strong value of the guide's information intensifies its 
greatest weakness — its print format limits its usefulness as a 
quick reference. The reader is bound by the categories into 
which the entries are sorted, with not even an index to aid in 
finding information regarding a specific vendor, Web site, or 
other entry. For example, while the Web sites of AddALL. 
com and Bookfinder are (correctly) classified as metasearch 
engines, this separates them from the listings for such close 
cousins (in practice) as Abebooks and Alibris. In addition, 
some Web addresses and other contact information, as 
might be expected, are already out of date. Librarians using 
this book will quickly long for an electronic format, which 
could be more easily searched and kept current (not to men- 
tion hotlinked). A must-read for those new to the area, the 
guide will yet hold value for even die most senior acquisi- 
tions librarian. — Darby Orcutt (darby_orcutt@ncsu.edu), 
North Carolina State University, Raleigh 
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